HuggingFaceM4

company

AI & ML interests

None defined yet.

Recent Activity

ndrugov updated a Space 3 days ago

HuggingFaceM4/encoder-free-vlm-demo

ndrugov updated a Space 3 days ago

HuggingFaceM4/encoder-free-vlm

ndrugov published a Space 3 days ago

HuggingFaceM4/encoder-free-vlm-demo

View all activity

Organization Card

Community About org cards

HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.

Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.

Collections 5

View 5 collections

spaces 22

IDEFICS Playground

Demo of Encoder-Free VLM Trained for $100

Ask questions about images and get answers instantly

Encoder-Free VLM

Train Your Own Encoder-Free VLM in $100

faster-qwen3-tts

Generate natural speech from text or voice samples

Reachy Mini Remote Control (Multi-User)

Remote control for Reachy Mini robots with authentication

Reachy Mini Key Claim

Request an ephemeral API key using an order number

models 34

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 428k • 304

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • 0.8B • Updated Oct 30, 2024 • 686 • 65

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 136k • 624

HuggingFaceM4/idefics2-8b-base

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 1.11k • 28

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 149 • 95

HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jul 27, 2024 • 10 • 1

HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jun 13, 2024 • 8 • 2

HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated May 9, 2024 • 8 • 1

HuggingFaceM4/idefics2-8b-chatty-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 9 • 5

HuggingFaceM4/idefics2-8b-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 13 • 26

datasets 82

HuggingFaceM4/FineVisionMax

Viewer • Updated Oct 21, 2025 • 24.2M • 26.8k • 27

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 132k • 494

HuggingFaceM4/lmms-eval-embeddings

Updated Sep 3, 2025 • 224 • 1

HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 2.19k • 52

HuggingFaceM4/Caltech-101

Updated Sep 10, 2024 • 182 • 4

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 12.8k • 305

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 302k • 546

HuggingFaceM4/FairFace

Viewer • Updated Apr 11, 2024 • 195k • 1.99k • 31

HuggingFaceM4/MMBench

Viewer • Updated Apr 5, 2024 • 11k • 284 • 4

HuggingFaceM4/WebSight

Viewer • Updated Mar 26, 2024 • 2.75M • 11.1k • 395

View 82 datasets