Moozonian

About 0 results
AI Overview
Generating...
Sponsored • AdSense Integration Active
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › ...n_phrases_%28full%29
List of Latin phrases (full) - Wikipedia
[The course of life]. In Eberle, Joseph [in German] (ed.). Viva Camena: Latina huius aetatis carmina [Viva the Muse: Contemporary Latin poems]. Zurich
arxiv.org arXiv
arxiv.org › abs › 2312.08935v3
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
In this paper, we present an innovative process-oriented math process reward model called \textbf{Math-Shepherd}, which assigns a reward score to each step of math problem solutions. The training of M...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › 2004_in_music
2004 in music - Wikipedia
"California Dreamin'" – Royal Gigolos "Call On Me" – Eric Prydz "(Can't Get My) Head Around You" – The Offspring "Caught Up" – Usher "Cer...
arxiv.org arXiv
arxiv.org › abs › 2406.18629v1
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
Mathematical reasoning presents a significant challenge for Large Language Models (LLMs) due to the extensive and precise chain of reasoning required for accuracy. Ensuring the correctness of each rea...
arxiv.org arXiv
arxiv.org › abs › 2602.10604v2
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
We introduce Step 3.5 Flash, a sparse Mixture-of-Experts (MoE) model that bridges frontier-level agentic intelligence and computational efficiency. We focus on what matters most when building agents: ...
arxiv.org arXiv
arxiv.org › abs › 2502.08941v3
Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
This paper analyzes multi-step temporal difference (TD)-learning algorithms within the ``deadly triad'' scenario, characterized by linear function approximation, off-policy learning, and bootstrapping...
arxiv.org arXiv
arxiv.org › abs › 2507.16632v3
Step-Audio 2 Technical Report
This paper presents Step-Audio 2, an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation. By integrating a latent audio encoder and r...
arxiv.org arXiv
arxiv.org › abs › 1708.00023v2
Two-step approach to scheduling quantum circuits
As the effort to scale up existing quantum hardware proceeds, it becomes necessary to schedule quantum gates in a way that minimizes the number of operations. There are three constraints that have to ...
arxiv.org arXiv
arxiv.org › abs › 2511.18834v1
FlowSteer: Guiding Few-Step Image Synthesis with Authentic Trajectories
With the success of flow matching in visual generation, sampling efficiency remains a critical bottleneck for its practical application. Among flow models' accelerating methods, ReFlow has been someho...
arxiv.org arXiv
arxiv.org › abs › 2008.01155v2
Diffusion Limit of Poisson Limit-Order Book Models
This ia a companion paper to Almost, Lehoczky, Shreve & Yu \cite{ALSY}, where the rationale for studying the diffusion limit of Poisson limit-order book models is explained and the results of a partic...
arxiv.org arXiv
arxiv.org › abs › 1608.00613v2
Skipping Selected Steps of DWT Computation in Lossless JPEG 2000 for Improved Bitrates
In order to improve bitrates of lossless JPEG 2000, we propose to modify the discrete wavelet transform (DWT) by skipping selected steps of its computation. We employ a heuristic to construct the skip...
arxiv.org arXiv
arxiv.org › abs › 2010.09575v2
Burton-Cabrera-Frank theory for surfaces with alternating step types
Burton-Cabrera-Frank (BCF) theory has proven to be a versatile framework to relate surface morphology and dynamics during crystal growth to the underlying mechanisms of adatom diffusion and attachment...