Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
magibu 's Collections
Pretrain Datasets
papers
Ekip karışık verileri
Fine-tuned LLMs
Turkish Language Healthcare Datasets

Pretrain Datasets

updated 2 days ago

Datasets we use for pretraining large language models

Upvote
-

  • omarkamali/wikipedia-monthly

    Viewer • Updated 10 days ago • 181M • 15.7k • 46

  • alibayram/hukuk_soru_cevap

    Viewer • Updated Nov 6, 2024 • 2.08k • 90 • 12

  • umutertugrul/turkish-hospital-medical-articles

    Viewer • Updated Oct 2, 2025 • 24.6k • 204 • 6

  • umutertugrul/turkish-medical-articles

    Viewer • Updated Oct 2, 2025 • 42.8k • 52 • 3

  • alibayram/tr-books

    Viewer • Updated 19 days ago • 3.7k • 31

  • selimfirat/bilkent-turkish-writings-dataset

    Viewer • Updated May 24, 2025 • 25.1k • 165 • 8

  • umutertugrul/turkish-academic-theses-dataset

    Viewer • Updated Aug 18, 2025 • 649k • 48 • 8

  • alibayram/onedio_haberler

    Viewer • Updated Jun 18, 2024 • 66.7k • 4 • 5

  • habanoz/news-tr-1.8M

    Viewer • Updated Oct 6, 2024 • 1.85M • 354 • 7

  • alibayram/hepsiburada_yorumlar

    Viewer • Updated Jun 18, 2024 • 2.66M • 76 • 13

  • alibayram/kitapyurdu_yorumlar

    Viewer • Updated Jun 18, 2024 • 405k • 25

  • alibayram/beyazperde_yorumlar

    Viewer • Updated Jun 18, 2024 • 192k • 20 • 5

  • BILGEM-AI/BILGE-Synthetic-Stories

    Viewer • Updated Nov 20, 2025 • 2.87M • 124 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required