MVTrack4Gen: Multi-View Point Tracking as Geometric Supervision for 4D Video Generation Paper • 2606.26087 • Published 5 days ago • 35
Rethinking RAG in Long Videos: What to Retrieve and How to Use It? Paper • 2606.13141 • Published 18 days ago • 36
TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Paper • 2606.04743 • Published 26 days ago • 47
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts Paper • 2606.02404 • Published 28 days ago • 59
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published May 28 • 78
Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents Paper • 2605.28775 • Published May 27 • 38
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published May 27 • 61
ResearchMath-14K: Scaling Research-Level Mathematics via Agents Paper • 2605.28003 • Published May 27 • 50
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published May 27 • 93
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published May 18 • 30
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published May 15 • 35
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published May 4 • 42
TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking Paper • 2605.12587 • Published May 12 • 37
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack Paper • 2509.25843 • Published Apr 14 • 20
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 45
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published Mar 23 • 39
BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs Paper • 2603.16557 • Published Mar 17 • 22