HuggingFaceM4
AI & ML interests
None defined yet.
Recent Activity
HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.
Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.
-
IDEFICS2 Playground
🐨169Chat with a visual AI assistant using text and images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 158k • 620 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 72 • 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.62k • 28
-
IDEFICS2 Playground
🐨169Chat with a visual AI assistant using text and images
-
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 158k • 620 -
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 72 • 95 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.62k • 28
spaces 20
IDEFICS Playground
faster-qwen3-tts
Generate speech audio from text with custom or cloned voices
Reachy Mini Remote Control (Multi-User)
Remote control for Reachy Mini robots with authentication
Reachy Mini Key Claim
Request an ephemeral API key using an order number
Gradium Setup
Little space to improve the onboarding to gradium
FineVision: Open Data is All You Need
A new open-source dataset for training VLMs