Text Generation
Safetensors
English
llama
shining-valiant
shining-valiant-2
valiant
valiant-labs
llama-3.2
llama-3.2-instruct
llama-3.2-instruct-3b
llama-3
llama-3-instruct
llama-3-instruct-3b
3b
science
physics
biology
chemistry
compsci
computer-science
engineering
technical
conversational
chat
instruct
Eval Results
eval format
Browse files
README.md
CHANGED
|
@@ -202,16 +202,3 @@ We care about open source.
|
|
| 202 |
For everyone to use.
|
| 203 |
|
| 204 |
We encourage others to finetune further from our models.
|
| 205 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| 206 |
-
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_ValiantLabs__Llama3.2-3B-ShiningValiant2)
|
| 207 |
-
|
| 208 |
-
| Metric |Value|
|
| 209 |
-
|-------------------|----:|
|
| 210 |
-
|Avg. |17.42|
|
| 211 |
-
|IFEval (0-Shot) |49.12|
|
| 212 |
-
|BBH (3-Shot) |19.03|
|
| 213 |
-
|MATH Lvl 5 (4-Shot)| 9.52|
|
| 214 |
-
|GPQA (0-shot) | 3.02|
|
| 215 |
-
|MuSR (0-shot) | 4.72|
|
| 216 |
-
|MMLU-PRO (5-shot) |19.09|
|
| 217 |
-
|
|
|
|
| 202 |
For everyone to use.
|
| 203 |
|
| 204 |
We encourage others to finetune further from our models.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|