Papers
AI & ML interests
R3 Model is all you need
Recent Activity
View all activity
models
66

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-14B-LoRA-4k
Text Generation
•
Updated
•
14

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-8B-14k
Text Generation
•
Updated
•
13

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-4B-14k
Text Generation
•
Updated
•
13

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-4k
15B
•
Updated
•
7

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-14k
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-14k
Text Generation
•
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-4k
Text Generation
•
15B
•
Updated
•
11

rubricreward/R3-Phi-4-reasoning-plus-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-14B-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-8B-LoRA-14k
Text Generation
•
8B
•
Updated
•
8
•
2
datasets
83
rubricreward/PolyGuardMix
Viewer
•
Updated
•
2.99M
•
6
rubricreward/arena-human-preference
Viewer
•
Updated
•
120k
rubricreward/R3-eval-XSUM-new
Viewer
•
Updated
•
5.36k
•
163
rubricreward/R3-eval-MMLU-STEM
Viewer
•
Updated
•
6.31k
•
170
rubricreward/R3-eval-BBH
Viewer
•
Updated
•
13.5k
•
156
rubricreward/R3-eval-RM-Bench-new
Viewer
•
Updated
•
11.9k
•
144
rubricreward/R3-eval-reward-bench-new
Viewer
•
Updated
•
2.99k
•
184
rubricreward/R3-Dataset-20K-NoRubric-Filter2
Viewer
•
Updated
•
2.41k
•
70
rubricreward/R3-Dataset-20K-NoRubric-Filter1
Viewer
•
Updated
•
10.3k
•
79
rubricreward/R3-Dataset-20K-NoRubric
Viewer
•
Updated
•
20k
•
99