Update README.md
Browse files
README.md
CHANGED
|
@@ -99,13 +99,10 @@ tokenizer.apply_chat_template(question, add_generation_prompt=True)
|
|
| 99 |
If you find this work useful in your research, please consider citing:
|
| 100 |
|
| 101 |
```bibtex
|
| 102 |
-
@
|
| 103 |
-
|
| 104 |
-
|
| 105 |
-
|
| 106 |
-
|
| 107 |
-
archivePrefix={arXiv},
|
| 108 |
-
primaryClass={cs.CL},
|
| 109 |
-
url={https://arxiv.org/abs/2502.06781},
|
| 110 |
}
|
| 111 |
```
|
|
|
|
| 99 |
If you find this work useful in your research, please consider citing:
|
| 100 |
|
| 101 |
```bibtex
|
| 102 |
+
@article{lyu2025exploring,
|
| 103 |
+
title={Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning},
|
| 104 |
+
author={Lyu, Chengqi and Gao, Songyang and Gu, Yuzhe and Zhang, Wenwei and Gao, Jianfei and Liu, Kuikun and Wang, Ziyi and Li, Shuaibin and Zhao, Qian and Huang, Haian and others},
|
| 105 |
+
journal={arXiv preprint arXiv:2502.06781},
|
| 106 |
+
year={2025}
|
|
|
|
|
|
|
|
|
|
| 107 |
}
|
| 108 |
```
|