Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HuggingFaceTB 's Collections
🧠 SmolLM3
SmolLM3 pretraining datasets
SmolLM3 evaluation datasets
Dolma LongAttn Graded
Reasoning datasets
SmolLM2
SmolVLM2 📺 Smallest video LM ever 🤏🏻
📚 LLM pretraining datasets
SmolVLM
🧩 SmolLM2 Intermediate Checkpoints
The Ultimate Collection of Code Classifiers
SmolVLM 256M & 500M
📐 FineMath
💻 Local SmolLMs
🪐 SmolLM
Instruct datasets
🌌 Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments

Instruct datasets

updated May 5
Upvote
4

  • HuggingFaceTB/everyday-conversations-llama3.1-2k

    Viewer • Updated Jan 29 • 2.38k • 1.35k • 111

  • HuggingFaceTB/Magpie-Pro-300K-Filtered-H4

    Viewer • Updated Aug 17, 2024 • 300k • 87 • 5

  • HuggingFaceTB/OpenHermes-2.5-H4

    Viewer • Updated Aug 17, 2024 • 1M • 63 • 6

  • HuggingFaceTB/self-oss-instruct-sc2-H4

    Viewer • Updated Aug 17, 2024 • 50.7k • 39 • 3

  • HuggingFaceTB/instruct-data-basics-smollm-H4

    Viewer • Updated Jul 24 • 767 • 139 • 1
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略