OpenGVLab/InternVL-SA-1B-Caption
Viewer
•
Updated
•
8.63M
•
176
•
17
Computer Vision
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale