CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition Paper • 2509.19768 • Published Sep 24 • 5
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published 9 days ago • 14
fiNERweb Collection A multilingual dataset for NER covering 91 langauges and 25 scripts • 3 items • Updated 9 days ago • 1
Datasets Wrapped 2025: Reasoning Collection The reasoning datasets that defined 2025. Part 1 of Datasets Wrapped 2025. #DatasetsWrapped2025 • 20 items • Updated 9 days ago • 1
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 1 day ago • 30
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 1 day ago • 47
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 1 day ago • 101
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 10 days ago • 97
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 1 day ago • 36
Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated about 16 hours ago • 25
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 24 days ago • 253