DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 17 days ago • 41
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20 • 2
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling Paper • 2510.04204 • Published Oct 5 • 20
ORLM: Training Large Language Models for Optimization Modeling Paper • 2405.17743 • Published May 28, 2024 • 3
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets Paper • 2502.01506 • Published Feb 3 • 38
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 27 • 2
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 85