Updated metrics
Browse files
README.md
CHANGED
|
@@ -179,7 +179,9 @@ Okay, the user is asking if I can talk to them. First, I need to clarify that I
|
|
| 179 |
|
| 180 |
| Benchmark | | |
|
| 181 |
|----------------------------------|----------------|---------------------------|
|
| 182 |
-
| | SmolLM3-3B | SmolLM3-3B-8da4w |
|
|
|
|
|
|
|
| 183 |
| **Reasoning** | | |
|
| 184 |
| hellaswag | 56.53 | 54.39 |
|
| 185 |
| gpqa_main_zeroshot | 32.37 | 27.46 |
|
|
|
|
| 179 |
|
| 180 |
| Benchmark | | |
|
| 181 |
|----------------------------------|----------------|---------------------------|
|
| 182 |
+
| | SmolLM3-3B | SmolLM3-3B-8da4w |
|
| 183 |
+
| **Popular aggregated benchmark** | | |
|
| 184 |
+
| mmlu | 59.29 | 55.52 |
|
| 185 |
| **Reasoning** | | |
|
| 186 |
| hellaswag | 56.53 | 54.39 |
|
| 187 |
| gpqa_main_zeroshot | 32.37 | 27.46 |
|