HuggingFaceFW/fineweb
Viewer
•
Updated
•
52.5B
•
290k
•
2.33k
We release large pre-training datasets to accelerate open LLM development. Part of the Hugging Face Science team (hf.co/science)