Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
zhibinlan
/
LLaVE-2B
like
45
Image-Text-to-Text
Transformers
Safetensors
English
qwen2
text-generation
Sentence Similarity
Embedding
zero-shot-image-classification
video-text-to-text
conversational
text-generation-inference
arXiv:
2503.04812
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
Train
Deploy
Use this model
main
LLaVE-2B
/
figures
680 kB
3 contributors
History:
1 commit
zhibinlan
Upload 3 files
b616de3
verified
8 months ago
leaderboard.png
220 kB
xet
Upload 3 files
8 months ago
results.png
Safe
335 kB
xet
Upload 3 files
8 months ago
zero-shot-vr.png
Safe
124 kB
xet
Upload 3 files
8 months ago