AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving Paper • 2508.09889 • Published 11 days ago • 32
INTIMA: A Benchmark for Human-AI Companionship Behavior Paper • 2508.09998 • Published 20 days ago • 2
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction Paper • 2508.11987 • Published 8 days ago • 57
AFM-Models Collection The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated 18 days ago • 13
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published 18 days ago • 104
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion Paper • 2508.04440 • Published 18 days ago • 9
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 17 days ago • 69
Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated 5 days ago • 52
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper • 2508.10711 • Published 10 days ago • 135
CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for Large Language Models • 5 items • Updated 17 days ago • 5
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 20 days ago • 27