Update README.md
Browse files
README.md
CHANGED
|
@@ -33,11 +33,11 @@ As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scali
|
|
| 33 |
|
| 34 |
| Model | Arena Hard (95% CI) |
|
| 35 |
|:-----------------------------|:----------------|
|
| 36 |
-
| Llama
|
| 37 |
| Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
|
| 38 |
| o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
|
| 39 |
| o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
|
| 40 |
-
| Llama
|
| 41 |
| claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
|
| 42 |
| Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
|
| 43 |
|
|
|
|
| 33 |
|
| 34 |
| Model | Arena Hard (95% CI) |
|
| 35 |
|:-----------------------------|:----------------|
|
| 36 |
+
| Llama-3.3-Nemotron-49B-Instruct + **Feedback-Edit ITS** | **93.4 (-1.1, 1.0)** |
|
| 37 |
| Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
|
| 38 |
| o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
|
| 39 |
| o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
|
| 40 |
+
| Llama-3.3-Nemotron-49B-Instruct | 88.3 (-1.6, 1.6) |
|
| 41 |
| claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
|
| 42 |
| Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
|
| 43 |
|