Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
wangclnlp
/
GRAM-RR-LLaMA-3.1-8B-RewardModel
like
0
Text Generation
Safetensors
English
llama
Reward
RewardModel
RewardReasoning
Reasoning
RLHF
Best-of-N
conversational
arxiv:
2509.02492
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
0060ca9
GRAM-RR-LLaMA-3.1-8B-RewardModel
/
README.md
wangclnlp
Upload folder using huggingface_hub
0060ca9
verified
29 days ago
preview
code
|
raw
Copy download link
history
blame
Safe
0 Bytes