Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 15 days ago • 42
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 21 days ago • 104
Real-time Vision Models Collection A collection of real-time detectors. • 19 items • Updated Nov 23, 2025 • 22
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11, 2025 • 372
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 23 days ago • 104
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 7 days ago • 114
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provides • 4 items • Updated Dec 4, 2025 • 19
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 29 days ago • 82
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 25 items • Updated Aug 26, 2025 • 37
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 34
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 265
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 81