Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
umarigan 's Collections
DPO Dataset
Computer Vision Datasets
Domain Spec. Datasets
Turkish Datasets
TR Models
Turkish LLM Fine-Tune Datasets

DPO Dataset

updated Mar 20, 2024

direct preference optimization related datasets

Upvote
-

  • argilla/reward-model-data-falcon

    Viewer • Updated Jun 7, 2023 • 7.4k • 54 • 1

  • jondurbin/gutenberg-dpo-v0.1

    Viewer • Updated Jan 12, 2024 • 918 • 372 • 157

  • ybisk/piqa

    Updated Jan 18, 2024 • 46.6k • 100

  • Dahoas/rm-hh-rlhf

    Viewer • Updated Dec 22, 2022 • 89.5k • 138 • 4

  • duxx/distilabel-intel-orca-dpo-pairs-tr

    Viewer • Updated Feb 5, 2024 • 3.98k • 55 • 7

  • Dahoas/rm_instruct_helpful_preferences

    Viewer • Updated Mar 1, 2023 • 90.7k • 23 • 4

  • Dahoas/1B_hh_sft_ppo_comparison

    Viewer • Updated Jan 26, 2023 • 100 • 38

  • abacusai/MetaMath_DPO_FewShot

    Viewer • Updated Feb 26, 2024 • 395k • 170 • 27

  • abacusai/HellaSwag_DPO_FewShot

    Viewer • Updated Feb 26, 2024 • 150k • 62 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required