Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
's Collections
🧠 SmolLM3
SmolLM3 pretraining datasets
SmolLM3 evaluation datasets
Dolma LongAttn Graded
Reasoning datasets
SmolLM2
SmolVLM2 📺 Smallest video LM ever 🤏🏻
📚 LLM pretraining datasets
SmolVLM
🧩 SmolLM2 Intermediate Checkpoints
The Ultimate Collection of Code Classifiers
SmolVLM 256M & 500M
📐 FineMath
💻 Local SmolLMs
🪐 SmolLM
Instruct datasets
🌌 Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
Instruct datasets
updated
May 5
Upvote
4
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
•
Updated
Jan 29
•
2.38k
•
1.35k
•
111
HuggingFaceTB/Magpie-Pro-300K-Filtered-H4
Viewer
•
Updated
Aug 17, 2024
•
300k
•
87
•
5
HuggingFaceTB/OpenHermes-2.5-H4
Viewer
•
Updated
Aug 17, 2024
•
1M
•
63
•
6
HuggingFaceTB/self-oss-instruct-sc2-H4
Viewer
•
Updated
Aug 17, 2024
•
50.7k
•
39
•
3
HuggingFaceTB/instruct-data-basics-smollm-H4
Viewer
•
Updated
Jul 24
•
767
•
139
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections