view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 12 days ago • 11
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 3 items • Updated 10 days ago • 11
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 17 days ago • 69
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 20 days ago • 473
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 21 days ago • 27
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 24 days ago • 63
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face 26 days ago • 124
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 27 days ago • 159
Dayhoff Atlas Collection The models and datasets that comprise the Dayhoff Atlas • 10 items • Updated 28 days ago • 8
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • Jul 25 • 79
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • Jul 23 • 39
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 655
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated Jul 8 • 5
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated 12 days ago • 28
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 637