Update README.md
Browse files
README.md
CHANGED
@@ -19,8 +19,8 @@ pipeline_tag: visual-question-answering
|
|
19 |
- **Model Stats:**
|
20 |
- Architecture: [InternViT-6B-448px-V1-5](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) + MLP + [InternLM2-Chat-20B](https://huggingface.co/internlm/internlm2-chat-20b)
|
21 |
- Params: 25.5B
|
22 |
-
- Image size: dynamic resolution, max to
|
23 |
-
- Number of visual tokens: 256 * number of tiles
|
24 |
|
25 |
- **Training Strategy:**
|
26 |
- Pretraining Stage
|
|
|
19 |
- **Model Stats:**
|
20 |
- Architecture: [InternViT-6B-448px-V1-5](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) + MLP + [InternLM2-Chat-20B](https://huggingface.co/internlm/internlm2-chat-20b)
|
21 |
- Params: 25.5B
|
22 |
+
- Image size: dynamic resolution, max to 40 tiles of 448 x 448 during inference.
|
23 |
+
- Number of visual tokens: 256 * (number of tiles + 1)
|
24 |
|
25 |
- **Training Strategy:**
|
26 |
- Pretraining Stage
|