Edit Models filters

Inference Providers

HF Inference API

Misc

vision-language

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

426

Full-text search

Active filters: vision-language

Mattimax/DATA-AI_Smol256M-Instruct

0.3B • Updated Feb 16 • 7

ctranslate2-4you/GOT-OCR2_0-Customized

Image-Text-to-Text • 0.7B • Updated Feb 17 • 3

sbintuitions/sarashina2-vision-8b

Image-to-Text • 8B • Updated Mar 27 • 340 • 7

sbintuitions/sarashina2-vision-14b

Image-to-Text • 14B • Updated Mar 27 • 240 • 9

UMA-IA/AQUILA-Engine-v1

Image-to-Text • 8B • Updated Mar 16 • 5 • 1

aihpi/food-waste-vlm

8B • Updated Apr 1 • 6

jpark677/internvl2-8b-mmbench-lora-ep-1-waa-false

Image-to-Text • 8B • Updated Apr 3 • 7

jpark677/internvl2-8b-mmbench-lora-ep-2-waa-false

Image-to-Text • 8B • Updated Apr 3 • 6

mradermacher/SpaceQwen2.5-VL-3B-Instruct-GGUF

Robotics • 3B • Updated Jul 31 • 119 • 1

mradermacher/SpaceQwen2.5-VL-3B-Instruct-i1-GGUF

Robotics • 3B • Updated Jul 11 • 253 • 1

mradermacher/AQUILA-Engine-v1-GGUF

8B • Updated Jul 31 • 50

mradermacher/AQUILA-Engine-v1-i1-GGUF

8B • Updated Jul 11 • 152

TheEighthDay/SeekWorld_RL_PLUS

8B • Updated Apr 19 • 241 • 1

mradermacher/SeekWorld_RL_PLUS-GGUF

8B • Updated Jul 31 • 46

nkkbr/ViCA-ARKitScenes

Video-Text-to-Text • 8B • Updated May 7 • 6

nkkbr/ViCA-ScanNet

Video-Text-to-Text • 8B • Updated May 7 • 10

nkkbr/ViCA-base

Video-Text-to-Text • 8B • Updated May 7 • 4

nkkbr/ViCA

Video-Text-to-Text • 8B • Updated May 28 • 6

nkkbr/ViCA-ScanNetPP

Video-Text-to-Text • 8B • Updated May 7 • 5

nkkbr/ViCA2-stage1-align

Video-Text-to-Text • 8B • Updated May 15 • 5

nkkbr/ViCA2-stage2-onevision-ft

Video-Text-to-Text • 8B • Updated May 15 • 7

nkkbr/ViCA2

Video-Text-to-Text • 8B • Updated May 28 • 9

nkkbr/ViCA2-init

Video-Text-to-Text • 8B • Updated May 15 • 5

remyxai/SpaceOm

Image-Text-to-Text • 4B • Updated Jul 6 • 8.17k • 12

ChongyuWang/ShowUI_Grounding_Qwen_2B_pretrained

Updated Apr 26 • 7

kevin510/friday

Text Generation • 4B • Updated Jun 19 • 5

yemalin/furniture-captioner

0.2B • Updated May 4

ragunath-ravi/blip-histopathology-finetuned

Image-to-Text • 0.2B • Updated May 4 • 25 • 3

nkkbr/ViCA2-thinkng

Video-Text-to-Text • 8B • Updated May 15 • 9

nkkbr/ViCA-thinking

Video-Text-to-Text • 8B • Updated May 7 • 7