keeeeenw
/

MicroLlama

Text Generation

text-generation-inference

Model card Files Files and versions

keeeeenw commited on Jul 21

Commit

9bec314

·

verified ·

1 Parent(s): 6403f6a

Add citations

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -286,3 +286,20 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
 |MuSR (0-shot)      | 4.79|
 |MMLU-PRO (5-shot)  | 1.53|

 |MuSR (0-shot)      | 4.79|
 |MMLU-PRO (5-shot)  | 1.53|
+## Citation
+If you use MicroLlama in your research or work, please cite the project using the following reference:
+APA:
+```
+Wang, Z. K. (2024). MicroLlama: A 300M-parameter language model trained from scratch. GitHub & Hugging Face. https://github.com/keeeeenw/MicroLlama, https://huggingface.co/keeeeenw/MicroLlama
+```
+BibTeX:
+```
+@misc{wang2024microllama,
+  author       = {Zixiao Ken Wang},
+  title        = {MicroLlama: A 300M-parameter language model trained from scratch},
+  year         = {2024},
+  howpublished = {\url{https://github.com/keeeeenw/MicroLlama}, \url{https://huggingface.co/keeeeenw/MicroLlama}},
+  note         = {GitHub and Hugging Face repositories}
+}
+```
+🙏 Please cite this work if you find it useful.