Update README.md
Browse files
README.md
CHANGED
|
@@ -95,6 +95,18 @@ Gradient is accelerating AI transformation across industries. Our AI Foundry inc
|
|
| 95 |
|
| 96 |
Drop an email to [[email protected]](mailto:[email protected])
|
| 97 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 98 |
## References
|
| 99 |
|
| 100 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
|
|
|
| 95 |
|
| 96 |
Drop an email to [[email protected]](mailto:[email protected])
|
| 97 |
|
| 98 |
+
|
| 99 |
+
## Citation
|
| 100 |
+
|
| 101 |
+
```bibtex
|
| 102 |
+
@misc{gradientlongcontextllama3,
|
| 103 |
+
title={Llama 3 Gradient: A series of long context models},
|
| 104 |
+
author={Leonid Pekelis and Michael Feil and Forrest Moret and Mark Huang and Tiffany Peng},
|
| 105 |
+
year={2024},
|
| 106 |
+
url = {https://gradient.ai/blog/scaling-rotational-embeddings-for-long-context-language-models}
|
| 107 |
+
}
|
| 108 |
+
```
|
| 109 |
+
|
| 110 |
## References
|
| 111 |
|
| 112 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|