Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AnyModal
/
Image-Captioning-Llama-3.2-1B
like
1
Follow
AnyModal
7
Image-to-Text
Safetensors
AnyModal/flickr30k
English
AnyModal
vlm
vision
multimodal
License:
mit
Model card
Files
Files and versions
xet
Community
1
0cb15c7
Image-Captioning-Llama-3.2-1B
/
README.md
ritabratamaiti
Update README.md
0cb15c7
verified
10 months ago
preview
code
|
raw
Copy download link
history
blame
119 Bytes
---
license:
mit
datasets:
-
AnyModal/flickr30k
base_model:
-
meta-llama/Llama-3.2-1B
-
google/vit-base-patch16-224
---