When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents Paper • 2606.05806 • Published 22 days ago • 23
MERIT: Matching Expertise via Rubric-Informed Training for Reviewer Assignment Paper • 2605.27865 • Published 30 days ago • 1
Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents? Paper • 2605.27881 • Published 30 days ago • 10
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published 30 days ago • 32