LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 5 days ago • 69
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published 3 days ago • 118
Video-CoE: Reinforcing Video Event Prediction via Chain of Events Paper • 2603.14935 • Published 10 days ago • 90
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published 23 days ago • 144
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published 22 days ago • 14
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing Paper • 2603.00141 • Published about 1 month ago • 138
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios Paper • 2602.22638 • Published 28 days ago • 107
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Paper • 2602.10090 • Published Feb 10 • 51
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published Feb 10 • 201
Thinking with Map Collection Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization • 4 items • Updated Feb 9 • 1
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published Jan 29 • 74
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published Jan 29 • 102
Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models Paper • 2601.20354 • Published Jan 28 • 112
Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation Paper • 2601.20614 • Published Jan 28 • 120