ViMoGen Collection The Quest for Generalizable Motion Generation: Data, Model, and Evaluation • 3 items • Updated 1 day ago
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 9 items • Updated 8 days ago • 14
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 16 days ago • 62
ViMoGen Collection The Quest for Generalizable Motion Generation: Data, Model, and Evaluation • 3 items • Updated 1 day ago
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 23 days ago • 72
SenseNova-SI Collection Scaling Spatial Intelligence with Multimodal Foundation Models • 9 items • Updated 8 days ago • 14
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published Nov 17, 2025 • 46