Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
免费去水印
Log In
Sign Up
1
Gandharv Patil
gp02-mcgill
Follow
sasha's profile picture
1 follower
·
2 following
gp1702
AI & ML interests
Reinforcement Learning, Stochastic Optimisation, Probabilistic Inference
Organizations
Papers
1
arxiv:
2506.16507
models
1
gp02-mcgill/zephyr-7b-dpo-qlora
Updated
Jan 8, 2025
datasets
3
Sort: Recently updated
gp02-mcgill/ultrafeedback_binarised_all_max
Viewer
•
Updated
Jan 31, 2025
•
176k
•
7
gp02-mcgill/ultrafeedback_binarised_rnd_max
Viewer
•
Updated
Jan 31, 2025
•
60.9k
•
7
gp02-mcgill/ultrafeedback_binarised_min_max
Viewer
•
Updated
Jan 31, 2025
•
60.9k
•
11
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now