Update README.md
Browse files
README.md
CHANGED
|
@@ -22,4 +22,14 @@ widget:
|
|
| 22 |
3. [Limitations](##limitations)
|
| 23 |
4. [Training](##training)
|
| 24 |
5. [License](##license)
|
| 25 |
-
6. [Citation](##citation)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
3. [Limitations](##limitations)
|
| 23 |
4. [Training](##training)
|
| 24 |
5. [License](##license)
|
| 25 |
+
6. [Citation](##citation)
|
| 26 |
+
|
| 27 |
+
## Model Summary
|
| 28 |
+
|
| 29 |
+
The JavaCoder models are !B parameter models trained on 80+ programming languages from [The Stack (v1.2)](https://huggingface.co/datasets/bigcode/the-stack), with opt-out requests excluded. The model uses [Multi Query Attention](https://arxiv.org/abs/1911.02150), [a context window of 8192 tokens](https://arxiv.org/abs/2205.14135), and was trained using the [Fill-in-the-Middle objective](https://arxiv.org/abs/2207.14255) on 1 trillion tokens.
|
| 30 |
+
|
| 31 |
+
- **Repository:**
|
| 32 |
+
- **Project Website:**
|
| 33 |
+
- **Paper:**
|
| 34 |
+
- **Point of Contact:**
|
| 35 |
+
- **Languages:** 80+ Programming languages
|