Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
免费去水印
Log In
Sign Up
reward modeling
classroom
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
tengyangx
authored
a paper
3 days ago
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
tengyangx
authored
a paper
3 days ago
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
tengyangx
authored
a paper
3 days ago
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
View all activity
Team members
5
cornfieldrm
's models
1
Sort: Recently updated
cornfieldrm/llama-3-1ep-lr2e_5-8K
Text Generation
•
8B
•
Updated
Apr 25, 2024
•
1
×
Free Tool
Free AI Image Generator
Create images in seconds. No sign-up, no paywall, no setup.
No Sign-Up
Instant Results
Ready to Use
Create Images Free
Great for posters, avatars, covers, and social visuals.
Free AI Image Generator
No sign-up. Instant results.
Open Now