HuggingFaceM4
AI & ML interests
None defined yet.
Recent Activity
HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.
Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.
-
IDEFICS2 Playground
🐨169Chat with a visual AI that answers questions about images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 136k • 624 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 149 • 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.11k • 28
-
IDEFICS2 Playground
🐨169Chat with a visual AI that answers questions about images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 136k • 624 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 149 • 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.11k • 28
spaces 22
IDEFICS Playground
Demo of Encoder-Free VLM Trained for $100
Ask questions about images and get answers instantly
Encoder-Free VLM
Train Your Own Encoder-Free VLM in $100
faster-qwen3-tts
Generate natural speech from text or voice samples
Reachy Mini Remote Control (Multi-User)
Remote control for Reachy Mini robots with authentication
Reachy Mini Key Claim
Request an ephemeral API key using an order number