Kwai-Klear
/

Klear-Reasoner-8B

Model card Files Files and versions

Suu commited on Aug 20, 2025

Commit

bd25c8d

·

verified ·

1 Parent(s): a742022

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -117,11 +117,19 @@ YOUR_TEST_FILE="<test_data_path>"
 ```
 ### Evaluation
-When we expand the inference budget to 64K and adopt **the YaRN method with a scaling factor of 2.5**. **Evaluation is coming soon, stay tuned.**
 The evaluation data for AIME24, AIME25, and HMMT2025 are available in our GitHub repository under the **benchmarks directory**.
 For LiveCodeBench, please download the data from the official website.
 ## 🤝 Citation
 If you find this work helpful, please cite our paper:
 ```bibtex

 ```
 ### Evaluation
+When we expand the inference budget to 64K and adopt **the YaRN method with a scaling factor of 2.5**.
 The evaluation data for AIME24, AIME25, and HMMT2025 are available in our GitHub repository under the **benchmarks directory**.
 For LiveCodeBench, please download the data from the official website.
+You can run the following commands to perform inference and evaluation:
+```bash
+git clone https://github.com/suu990901/KlearReasoner
+cd KlearReasoner/benchmarks
+python inference.py --model <KlearReasoner-8B_path> --n 64 --dataset_path ./benchmarks/aime24.qs.jsonl
+python judge_math.py <path_to_inference_results>
+```
 ## 🤝 Citation
 If you find this work helpful, please cite our paper:
 ```bibtex