Update README.md
Browse files
README.md
CHANGED
@@ -15,8 +15,10 @@ Paper abstract:
|
|
15 |
- **Repository:** https://github.com/abao1999/panda
|
16 |
|
17 |
<!-- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). -->
|
|
|
18 |
|
19 |
NOTE: we are currently in the process of scaling up our model and training, so stay tuned!
|
|
|
20 |
|
21 |
## Citation
|
22 |
|
|
|
15 |
- **Repository:** https://github.com/abao1999/panda
|
16 |
|
17 |
<!-- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1). -->
|
18 |
+
This checkpoint was trained for (only) 100k iterations, with per-device batch size 1024, across 4 AMD MI100X GPUs.
|
19 |
|
20 |
NOTE: we are currently in the process of scaling up our model and training, so stay tuned!
|
21 |
+
Update: We have released a bigger model: [panda-72M](https://huggingface.co/GilpinLab/panda-72M)
|
22 |
|
23 |
## Citation
|
24 |
|