Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up

datablations

https://github.com/huggingface/datablations
Activity Feed Request to join this org

AI & ML interests

Scaling Data-Constrained Language Models

Thomas Wolf's profile picture Teven Le Scao's profile picture Sasha Rush's profile picture Niklas Muennighoff's profile picture Aleksandra Piktus's profile picture Nouamane Tazi's profile picture Sampo Pyysalo's profile picture Colin Raffel's profile picture Risto Luukkonen's profile picture

datablations 's datasets 13

datablations/scripts

Viewer • Updated Jun 15, 2023 • 3.48M • 2.92k

datablations/oscar-subsets

Viewer • Updated Jun 14, 2023 • 365k • 1.37k

datablations/c4-subsets

Viewer • Updated Jun 14, 2023 • 729k • 1.2k • 5

datablations/c4-filter-megatron

Updated May 28, 2023 • 733

datablations/oscar-filter-megatron

Updated May 27, 2023 • 358

datablations/python-megatron

Updated May 22, 2023 • 3.79k • 1

datablations/subsets

Viewer • Updated May 10, 2023 • 365k • 92

datablations/oscar-filter

Viewer • Updated May 10, 2023 • 432M • 9k

datablations/oscar-dedup-expanded

Viewer • Updated May 10, 2023 • 432M • 7.07k

datablations/mup

Updated Apr 24, 2023 • 1.36k

datablations/c4-filter

Viewer • Updated Feb 1, 2023 • 365M • 6.16k

datablations/c4-filter-small

Viewer • Updated Jan 17, 2023 • 100k • 100

datablations/oscar-filter-small

Viewer • Updated Nov 24, 2022 • 100k • 21
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required