Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

1,364

Full-text search

Active filters: multimodal

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.06M • • 1.08k

NCSOFT/VARCO-VISION-2.0-14B

Image-Text-to-Text • 15B • Updated 2 days ago • 6.95k • 25

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 3.86M • 466

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 115k • 1.71k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 92.4k • 325

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 474k • • 410

stepfun-ai/Step1X-Edit

Image-to-Image • Updated 15 days ago • 272 • • 306

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 180k • 256

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated 8 days ago • 16k • 436

Kwai-Keye/Keye-VL-8B-Preview

Video-Text-to-Text • 9B • Updated 17 days ago • 35.7k • 68

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 454k • • 516

lingshu-medical-mllm/Lingshu-7B

Image-Text-to-Text • 8B • Updated 29 days ago • 6.19k • 47

DocReRank/DocReRank-Reranker

Visual Document Retrieval • Updated 1 day ago • 4

robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17, 2024 • 760 • 89

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 28 • 28.7k • • 264

NCSOFT/GME-VARCO-VISION-Embedding

Feature Extraction • 8B • Updated 6 days ago • 1.17k • 9

NCSOFT/VARCO-VISION-2.0-1.7B

Image-Text-to-Text • Updated 8 days ago • 8

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 47.7k • 289

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 33.7k • 105

Qwen/Qwen2.5-VL-3B-Instruct-AWQ

Image-Text-to-Text • 1B • Updated Apr 6 • 24.5k • 46

Qwen/Qwen2.5-VL-32B-Instruct-AWQ

Image-Text-to-Text • 6B • Updated Apr 6 • 66.1k • 52

openbmb/AgentCPM-GUI

Image-Text-to-Text • 8B • Updated Jun 14 • 531 • 123

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 11.7k • 19

FreedomIntelligence/Janus-4o-7B

Any-to-Any • 7B • Updated 23 days ago • 1.8k • 40

imageomics/bioclip

Zero-Shot Image Classification • Updated May 17, 2024 • 164k • 51

Goekdeniz-Guelmez/J.O.S.I.E.v4o

Any-to-Any • Updated Oct 29, 2024 • 26

openvla/openvla-7b

Image-Text-to-Text • 8B • Updated Sep 16, 2024 • 449k • 130

qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 113 • 111

lmms-lab/llava-onevision-qwen2-0.5b-si

Text Generation • 0.9B • Updated Sep 2, 2024 • 3.13k • 14

lmms-lab/llava-onevision-qwen2-0.5b-ov

Text Generation • 0.9B • Updated Sep 2, 2024 • 39.5k • 20