nvidia
/

AceMath-7B-Instruct

Text Generation

Model card Files Files and versions

AceMath-7B-Instruct / evaluation /README.md

zihanliu's picture

Upload 3 files

5689fb0 verified 10 months ago

|

336 Bytes




	## Introduction
	This is the evaluation script used to reproduce math benchmarks scores for AceMath-1.5B/7B/72B-Instruct models based on their outputs. The benchmark can be downloaded from [Qwen2.5-Math](https://github.com/QwenLM/Qwen2.5-Math/tree/main/evaluation/data).

	## Calculate Scores
	```console
	python calculate_scores.py
	```