Update README.md
Browse files
README.md
CHANGED
|
@@ -34,4 +34,18 @@ The model usage is documented [here](https://github.com/Multiverse4FM/Multiverse
|
|
| 34 |
|
| 35 |
# Acknowledge
|
| 36 |
|
| 37 |
-
Thanks to the amazing s1 team for their s1.1 dataset as base data, and the Qwen team for their Qwen-2.5-32B-Instruct as base model.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
|
| 35 |
# Acknowledge
|
| 36 |
|
| 37 |
+
Thanks to the amazing s1 team for their s1.1 dataset as base data, and the Qwen team for their Qwen-2.5-32B-Instruct as base model.
|
| 38 |
+
|
| 39 |
+
# Citation Information
|
| 40 |
+
|
| 41 |
+
```bibtex
|
| 42 |
+
@misc{yang2025multiverselanguagemodelssecretly,
|
| 43 |
+
title={Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation},
|
| 44 |
+
author={Xinyu Yang and Yuwei An and Hongyi Liu and Tianqi Chen and Beidi Chen},
|
| 45 |
+
year={2025},
|
| 46 |
+
eprint={2506.09991},
|
| 47 |
+
archivePrefix={arXiv},
|
| 48 |
+
primaryClass={cs.LG},
|
| 49 |
+
url={https://arxiv.org/abs/2506.09991},
|
| 50 |
+
}
|
| 51 |
+
```
|