nvidia
/

OpenMath2-Llama3.1-70B-nemo

@@ -1,20 +1,24 @@
 ---
-license: llama3.1
 base_model:
 - meta-llama/Llama-3.1-70B
 datasets:
 - nvidia/OpenMathInstruct-2
 language:
 - en
 tags:
 - nvidia
 - math
 ---
 # OpenMath2-Llama3.1-70B-nemo
 [NeMo](https://github.com/NVIDIA/NeMo) checkpoint for [OpenMath2-Llama3.1-70B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B) which is obtained by finetuning [Llama3.1-70B-Base](https://huggingface.co/meta-llama/Llama-3.1-70B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
 The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) on [MATH](https://github.com/hendrycks/math) by 3.9%.
@@ -22,10 +26,10 @@ The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/
 | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
 |:---|:---:|:---:|:---:|:---:|:---:|
 | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
-| OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
 | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
 | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
-| **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
 | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
 The pipeline we used to produce the data and models is fully open-sourced!
@@ -65,4 +69,4 @@ If you find our work useful, please consider citing us!
 ## Terms of use
-By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)

 ---
 base_model:
 - meta-llama/Llama-3.1-70B
 datasets:
 - nvidia/OpenMathInstruct-2
 language:
 - en
+license: llama3.1
 tags:
 - nvidia
 - math
+pipeline_tag: text-generation
+library_name: nemo
 ---
 # OpenMath2-Llama3.1-70B-nemo
 [NeMo](https://github.com/NVIDIA/NeMo) checkpoint for [OpenMath2-Llama3.1-70B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B) which is obtained by finetuning [Llama3.1-70B-Base](https://huggingface.co/meta-llama/Llama-3.1-70B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
+This model is presented in the paper [OpenCodeReasoning: Advancing Data Distillation for Competitive Coding](https://huggingface.co/papers/2504.01943).
 The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) on [MATH](https://github.com/hendrycks/math) by 3.9%.
 | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
 |:---|:---:|:---:|:---:|:---:|:---:|
 | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
+| OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
 | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
 | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
+| **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
 | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
 The pipeline we used to produce the data and models is fully open-sourced!
 ## Terms of use
+By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)