Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
免费去水印
Log In
Sign Up
trl-lib
's Collections
Preference datasets
Stepwise supervision datasets
Prompt-completion datasets
Prompt-only datasets
Unpaired preference datasets
Comparing DPO with IPO and KTO
Online-DPO
Unpaired preference datasets
updated
Jan 8, 2025
Upvote
1
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Jan 8, 2025
•
16.6k
•
66
•
4
trl-lib/kto-mix-14k
Viewer
•
Updated
Mar 25, 2024
•
15k
•
478
•
9
Upvote
1
Share collection
View history
Collection guide
Browse collections
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now