The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 11 days ago • 3
The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 11 days ago • 3
The Extrapolation Cliff in On-Policy Distillation of Near-Deterministic Structured Outputs Paper • 2605.08737 • Published 11 days ago • 3
Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities Paper • 2601.21937 • Published Jan 29 • 19