| license: apache-2.0 | |
| datasets: | |
| - LinkSoul/Chinese-LLaVA-Vision-Instructions | |
| language: | |
| - en | |
| - zh | |
| tags: | |
| - llava | |
| - vlm | |
| The bilingual English/Chinese Llama2-7B-Chat VLM trained via LORA for https://arxiv.org/abs/2406.11665. | |
| The Chinese half of the training data used for multimodal alignment and visual instruction tuning is sampled from [here](https://huggingface.co/datasets/LinkSoul/Chinese-LLaVA-Vision-Instructions). |