wangclnlp commited on
Commit
9bdbf53
·
verified ·
1 Parent(s): 68da516

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -40,7 +40,7 @@ We evaluate our model on two challenging reward benchmarks, [RM-Bench](https://g
40
  |**Open-Source Reward Models**||||||
41
  |Llama-3.1-Nemotron-70B-Reward | 70B | 70.7 | 64.3 | 57.4 | 90.3 | 70.7|
42
  |Skywork-Reward-Gemma-2-27B | 27B | 71.8 | 59.2 | 56.6 | 94.3 | 70.5|
43
- |Skywork-Reward-Llama-3.1-8B | 27B | 69.5 | 60.6 | 54.5 | 95.7 | 70.1|
44
  |Nemotron-Super | 49B | 73.7 | 91.4 | 75.0 | 90.6 | 82.7 |
45
  |Nemotron-Super-Multilingual | 49B | **77.2** | **91.9** | 74.7 | 92.9 | 84.2|
46
  |**Reasoning Reward Models**||||||
@@ -68,7 +68,7 @@ We evaluate our model on two challenging reward benchmarks, [RM-Bench](https://g
68
  |**Open-Source Reward Models**||||||
69
  |Llama-3.1-Nemotron-70B-Reward | 70B | 62.3 | 72.5 | 76.8 | 57.1 | 67.2|
70
  |Skywork-Reward-Gemma-2-27B | 27B | 59.7 | 66.3 | 83.9 | 50.0 | 65.0|
71
- |Skywork-Reward-Llama-3.1-8B | 27B | 59.1 | 64.3 | 76.8 | 50.0 | 62.5|
72
  |Nemotron-Super | 49B | 71.4 | 73.5 | 87.5 | 76.2 | 77.2 |
73
  |Nemotron-Super-Multilingual | 49B | 64.9 | 74.5 | 87.5 | 73.8 | 75.2|
74
  |**Reasoning Reward Models**||||||
 
40
  |**Open-Source Reward Models**||||||
41
  |Llama-3.1-Nemotron-70B-Reward | 70B | 70.7 | 64.3 | 57.4 | 90.3 | 70.7|
42
  |Skywork-Reward-Gemma-2-27B | 27B | 71.8 | 59.2 | 56.6 | 94.3 | 70.5|
43
+ |Skywork-Reward-Llama-3.1-8B | 8B | 69.5 | 60.6 | 54.5 | 95.7 | 70.1|
44
  |Nemotron-Super | 49B | 73.7 | 91.4 | 75.0 | 90.6 | 82.7 |
45
  |Nemotron-Super-Multilingual | 49B | **77.2** | **91.9** | 74.7 | 92.9 | 84.2|
46
  |**Reasoning Reward Models**||||||
 
68
  |**Open-Source Reward Models**||||||
69
  |Llama-3.1-Nemotron-70B-Reward | 70B | 62.3 | 72.5 | 76.8 | 57.1 | 67.2|
70
  |Skywork-Reward-Gemma-2-27B | 27B | 59.7 | 66.3 | 83.9 | 50.0 | 65.0|
71
+ |Skywork-Reward-Llama-3.1-8B | 8B | 59.1 | 64.3 | 76.8 | 50.0 | 62.5|
72
  |Nemotron-Super | 49B | 71.4 | 73.5 | 87.5 | 76.2 | 77.2 |
73
  |Nemotron-Super-Multilingual | 49B | 64.9 | 74.5 | 87.5 | 73.8 | 75.2|
74
  |**Reasoning Reward Models**||||||