Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ndkhanh95
/
Paligemma
like
1
Image-Text-to-Text
Transformers
Safetensors
paligemma
image-to-text
text-generation-inference
arxiv:
20 papers
License:
gemma
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Paligemma
/
big_vision_repo
/
big_vision
/
datasets
235 kB
1 contributor
History:
1 commit
ndkhanh95
Upload 304 files
fa1a600
verified
10 months ago
ai2d
Upload 304 files
10 months ago
aokvqa
Upload 304 files
10 months ago
chartqa
Upload 304 files
10 months ago
coco35l
Upload 304 files
10 months ago
countbenchqa
Upload 304 files
10 months ago
docvqa
Upload 304 files
10 months ago
gqa
Upload 304 files
10 months ago
imagenet
Upload 304 files
10 months ago
infovqa
Upload 304 files
10 months ago
nocaps
Upload 304 files
10 months ago
okvqa
Upload 304 files
10 months ago
pope
Upload 304 files
10 months ago
refcoco
Upload 304 files
10 months ago
rsvqa_hr
Upload 304 files
10 months ago
rsvqa_lr
Upload 304 files
10 months ago
scicap
Upload 304 files
10 months ago
science_qa
Upload 304 files
10 months ago
screen2words
Upload 304 files
10 months ago
stvqa
Upload 304 files
10 months ago
tallyqa
Upload 304 files
10 months ago
textcaps
Upload 304 files
10 months ago
textvqa
Upload 304 files
10 months ago
vizwizvqa
Upload 304 files
10 months ago
vqa
Upload 304 files
10 months ago
widgetcap
Upload 304 files
10 months ago
xgqa
Upload 304 files
10 months ago
xm3600
Upload 304 files
10 months ago
core.py
Safe
3.01 kB
Upload 304 files
10 months ago
jsonl.py
Safe
6.69 kB
Upload 304 files
10 months ago
sequence_packing.py
Safe
7.95 kB
Upload 304 files
10 months ago
tfds.py
Safe
3.46 kB
Upload 304 files
10 months ago