Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Nebius AI
Novita
Together AI
Groq
fal
Hyperbolic
+ 6
Apply filters
Models
6,078
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
OpenGVLab/InternVL3-14B
Image-Text-to-Text
•
15B
•
Updated
May 29
•
1.07M
•
73
OpenGVLab/InternVL3-2B
Image-Text-to-Text
•
2B
•
Updated
May 29
•
50.3k
•
36
Skywork/Skywork-VL-Reward-7B
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
483
•
45
google/gemma-3-27b-it-qat-q4_0-unquantized
Image-Text-to-Text
•
27B
•
Updated
Apr 15
•
7.98k
•
36
OpenGVLab/InternVL3-14B-AWQ
Image-Text-to-Text
•
Updated
May 29
•
25.4k
•
7
lmstudio-community/gemma-3-4B-it-qat-GGUF
Image-Text-to-Text
•
4B
•
Updated
Apr 18
•
4.58k
•
18
OpenGVLab/InternVL3-8B-hf
Image-Text-to-Text
•
8B
•
Updated
Apr 23
•
22.7k
•
9
tonyli8623/Hicoder-R1-Distill-Gemma-27B
Image-Text-to-Text
•
27B
•
Updated
Apr 21
•
12
•
4
unsloth/gemma-3-4b-it-qat-GGUF
Image-Text-to-Text
•
4B
•
Updated
Jun 15
•
10.4k
•
18
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
12B
•
Updated
Apr 29
•
33.2k
•
•
53
unsloth/gemma-3-27b-it-qat-GGUF
Image-Text-to-Text
•
27B
•
Updated
May 9
•
9.44k
•
15
leon-se/gemma-3-27b-it-qat-W4A16-G128
Image-Text-to-Text
•
7B
•
Updated
Apr 27
•
17.3k
•
11
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
2.51k
•
28
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
1.63k
•
15
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
0.2B
•
Updated
May 8
•
511
•
95
osunlp/WebJudge-7B
Image-Text-to-Text
•
8B
•
Updated
May 12
•
107
•
6
MathLLMs/MathCoder-VL-2B
Image-Text-to-Text
•
2B
•
Updated
May 28
•
24
•
4
Mungert/UI-TARS-1.5-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
15 days ago
•
4.14k
•
9
unsloth/InternVL3-38B-GGUF
Image-Text-to-Text
•
33B
•
Updated
May 18
•
402
•
3
google/medgemma-4b-pt
Image-Text-to-Text
•
4B
•
Updated
May 21
•
9.09k
•
110
ChenShawn/DeepEyes-7B
Image-Text-to-Text
•
8B
•
Updated
May 22
•
5.17k
•
11
unsloth/medgemma-4b-it
Image-Text-to-Text
•
5B
•
Updated
Jul 15
•
2.28k
•
2
unsloth/medgemma-4b-it-GGUF
Image-Text-to-Text
•
4B
•
Updated
Jul 15
•
15.7k
•
41
unsloth/medgemma-27b-text-it-unsloth-bnb-4bit
Image-Text-to-Text
•
15B
•
Updated
May 20
•
3.49k
•
3
WaltonFuture/Qwen2.5-VL-7B-MM-UPT-MMR1
Image-Text-to-Text
•
8B
•
Updated
Jun 4
•
10
•
3
numind/NuExtract-2.0-4B
Image-Text-to-Text
•
4B
•
Updated
16 days ago
•
2.37k
•
18
GSAI-ML/LLaDA-V
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
8.38k
•
17
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Jun 7
•
14.1k
•
157
bartowski/xlangai_Jedi-7B-1080p-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jun 1
•
90
•
3
lmstudio-community/Jedi-7B-1080p-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jun 1
•
47
•
2
Previous
1
...
6
7
8
9
10
...
100
Next