HuggingFaceM4

company

AI & ML interests

None defined yet.

Organization Card

Community About org cards

HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.

Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.

Collections 5

View 5 collections

spaces 20

IDEFICS Playground

faster-qwen3-tts

Generate natural speech from text or voice samples

Reachy Mini Remote Control (Multi-User)

Remote control for Reachy Mini robots with authentication

Reachy Mini Key Claim

Request an ephemeral API key using an order number

Gradium Setup

Little space to improve the onboarding to gradium

FineVision: Open Data is All You Need

A new open-source dataset for training VLMs

models 34

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 414k • 304

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • 0.8B • Updated Oct 30, 2024 • 621 • 65

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 133k • 623

HuggingFaceM4/idefics2-8b-base

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 1.05k • 28

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 174 • 95

HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jul 27, 2024 • 14 • 1

HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jun 13, 2024 • 13 • 2

HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated May 9, 2024 • 12 • 1

HuggingFaceM4/idefics2-8b-chatty-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 7 • 5

HuggingFaceM4/idefics2-8b-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 16 • 26

datasets 82

HuggingFaceM4/FineVisionMax

Viewer • Updated Oct 21, 2025 • 24.2M • 12.4k • 27

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 175k • 489

HuggingFaceM4/lmms-eval-embeddings

Updated Sep 3, 2025 • 446 • 1

HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 2.3k • 52

HuggingFaceM4/Caltech-101

Updated Sep 10, 2024 • 206 • 4

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 12.6k • 304

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 598k • 545

HuggingFaceM4/FairFace

Viewer • Updated Apr 11, 2024 • 195k • 2.89k • 30

HuggingFaceM4/MMBench

Viewer • Updated Apr 5, 2024 • 11k • 517 • 4

HuggingFaceM4/WebSight

Viewer • Updated Mar 26, 2024 • 2.75M • 15.3k • 394

View 82 datasets