CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 19
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 19 • 2
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 19
ORLM: Training Large Language Models for Optimization Modeling Paper • 2405.17743 • Published May 28, 2024 • 3
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published Feb 18 • 85
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets Paper • 2502.01506 • Published Feb 3 • 38
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 33
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 33
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 33 • 2
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 75
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 75
Enabling Scalable Oversight via Self-Evolving Critic Paper • 2501.05727 • Published Jan 10 • 75 • 2
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 84