Upload folder using huggingface_hub
Browse files
README.md
CHANGED
@@ -40,7 +40,7 @@ We evaluate our model on two challenging reward benchmarks, [RM-Bench](https://g
|
|
40 |
|**Open-Source Reward Models**||||||
|
41 |
|Llama-3.1-Nemotron-70B-Reward | 70B | 70.7 | 64.3 | 57.4 | 90.3 | 70.7|
|
42 |
|Skywork-Reward-Gemma-2-27B | 27B | 71.8 | 59.2 | 56.6 | 94.3 | 70.5|
|
43 |
-
|Skywork-Reward-Llama-3.1-8B |
|
44 |
|Nemotron-Super | 49B | 73.7 | 91.4 | 75.0 | 90.6 | 82.7 |
|
45 |
|Nemotron-Super-Multilingual | 49B | **77.2** | **91.9** | 74.7 | 92.9 | 84.2|
|
46 |
|**Reasoning Reward Models**||||||
|
@@ -68,7 +68,7 @@ We evaluate our model on two challenging reward benchmarks, [RM-Bench](https://g
|
|
68 |
|**Open-Source Reward Models**||||||
|
69 |
|Llama-3.1-Nemotron-70B-Reward | 70B | 62.3 | 72.5 | 76.8 | 57.1 | 67.2|
|
70 |
|Skywork-Reward-Gemma-2-27B | 27B | 59.7 | 66.3 | 83.9 | 50.0 | 65.0|
|
71 |
-
|Skywork-Reward-Llama-3.1-8B |
|
72 |
|Nemotron-Super | 49B | 71.4 | 73.5 | 87.5 | 76.2 | 77.2 |
|
73 |
|Nemotron-Super-Multilingual | 49B | 64.9 | 74.5 | 87.5 | 73.8 | 75.2|
|
74 |
|**Reasoning Reward Models**||||||
|
|
|
40 |
|**Open-Source Reward Models**||||||
|
41 |
|Llama-3.1-Nemotron-70B-Reward | 70B | 70.7 | 64.3 | 57.4 | 90.3 | 70.7|
|
42 |
|Skywork-Reward-Gemma-2-27B | 27B | 71.8 | 59.2 | 56.6 | 94.3 | 70.5|
|
43 |
+
|Skywork-Reward-Llama-3.1-8B | 8B | 69.5 | 60.6 | 54.5 | 95.7 | 70.1|
|
44 |
|Nemotron-Super | 49B | 73.7 | 91.4 | 75.0 | 90.6 | 82.7 |
|
45 |
|Nemotron-Super-Multilingual | 49B | **77.2** | **91.9** | 74.7 | 92.9 | 84.2|
|
46 |
|**Reasoning Reward Models**||||||
|
|
|
68 |
|**Open-Source Reward Models**||||||
|
69 |
|Llama-3.1-Nemotron-70B-Reward | 70B | 62.3 | 72.5 | 76.8 | 57.1 | 67.2|
|
70 |
|Skywork-Reward-Gemma-2-27B | 27B | 59.7 | 66.3 | 83.9 | 50.0 | 65.0|
|
71 |
+
|Skywork-Reward-Llama-3.1-8B | 8B | 59.1 | 64.3 | 76.8 | 50.0 | 62.5|
|
72 |
|Nemotron-Super | 49B | 71.4 | 73.5 | 87.5 | 76.2 | 77.2 |
|
73 |
|Nemotron-Super-Multilingual | 49B | 64.9 | 74.5 | 87.5 | 73.8 | 75.2|
|
74 |
|**Reasoning Reward Models**||||||
|