Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding Paper • 2512.17532 • Published 7 days ago • 62
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs Paper • 2512.17008 • Published 8 days ago • 10
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers Paper • 2512.17351 • Published 8 days ago • 20
An Anatomy of Vision-Language-Action Models: From Modules to Milestones and Challenges Paper • 2512.11362 • Published 15 days ago • 20
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Paper • 2512.16793 • Published 8 days ago • 71
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published 8 days ago • 105
Hybrid Attribution Priors for Explainable and Robust Model Training Paper • 2512.14719 • Published 18 days ago • 2
In Pursuit of Pixel Supervision for Visual Pre-training Paper • 2512.15715 • Published 9 days ago • 8
VOYAGER: A Training Free Approach for Generating Diverse Datasets using LLMs Paper • 2512.12072 • Published 14 days ago • 17
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets Paper • 2512.15110 • Published 10 days ago • 7
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression? Paper • 2512.15649 • Published 9 days ago • 6
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning Paper • 2512.13874 • Published 11 days ago • 16
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models Paper • 2512.15713 • Published 9 days ago • 15
Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning Paper • 2512.15693 • Published 9 days ago • 16
HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices Paper • 2512.14052 • Published 11 days ago • 39
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published 10 days ago • 39
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 10 days ago • 41
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 13 days ago • 39