InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 10 days ago • 79
Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper • 2606.02684 • Published 20 days ago • 16
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published May 20 • 111
Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis Paper • 2605.18451 • Published May 18 • 41
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published May 14 • 88
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video Paper • 2605.15182 • Published May 14 • 39
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published May 12 • 193
EgoSim: Egocentric World Simulator for Embodied Interaction Generation Paper • 2604.01001 • Published Apr 1 • 38
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE Paper • 2602.08961 • Published Feb 9 • 5
WorldCompass: Reinforcement Learning for Long-Horizon World Models Paper • 2602.09022 • Published Feb 9 • 21
SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation Paper • 2602.02402 • Published Feb 2 • 33