IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs Paper • 2604.10539 • Published Apr 12 • 3
Accelerating Speculative Decoding with Block Diffusion Draft Trees Paper • 2604.12989 • Published about 1 month ago • 8
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing Paper • 2604.05018 • Published Apr 6 • 3
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published Apr 6 • 46
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published Apr 8 • 38
A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens Paper • 2604.04913 • Published Apr 6 • 12
Shepherd: A Runtime Substrate Empowering Meta-Agents with a Formalized Execution Trace Paper • 2605.10913 • Published 3 days ago • 1
Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient? Paper • 2605.10848 • Published 3 days ago • 3
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI Paper • 2605.06651 • Published 7 days ago • 14
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 149
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency Paper • 2605.06924 • Published 7 days ago • 15