Edit Models filters

Tasks

Text Generation

Image-Text-to-Text

Parameters

Libraries

Transformers.js

Apps

Inference Providers

Models

8,159

Full-text search

Active filters: image-text-to-text

baidu/ERNIE-4.5-VL-28B-A3B-Thinking

Image-Text-to-Text • 30B • Updated 1 day ago • 50 • 187

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 8 days ago • 3.34M • • 2.62k

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated about 2 hours ago • 37.4k • 1.25k

jzhang533/PaddleOCR-VL-For-Manga

Image-Text-to-Text • 1.0B • Updated 6 days ago • 111 • 44

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated 27 days ago • 1.55M • • 416

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated Oct 9 • 2.53M • • 376

unsloth/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 2 days ago • 5.79k • 24

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated 19 days ago • 247k • 176

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 933k • 949

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23 • 111k • 1.01k

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated 11 days ago • 1.06M • 1.12k

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated 17 days ago • 43.7k • • 689

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated 27 days ago • 564k • 226

huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated

Image-Text-to-Text • 9B • Updated 10 days ago • 28.8k • 69

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5M • • 1.34k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 837k • • 1.68k

google/medgemma-4b-it

Image-Text-to-Text • 4B • Updated 14 days ago • 451k • 748

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated 1 day ago • 11.1k • 47

unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF

Image-Text-to-Text • 31B • Updated about 3 hours ago • 43.8k • 26

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated Jul 14 • 43.2k • 815

nanonets/Nanonets-OCR2-3B

Image-Text-to-Text • 4B • Updated 26 days ago • 90.5k • 439

Jalea96/DeepSeek-OCR-bnb-4bit-NF4

Image-Text-to-Text • 3B • Updated 14 days ago • 4.72k • 13

mlx-community/DeepSeek-OCR-8bit

Image-Text-to-Text • 1B • Updated 15 days ago • 6.33k • 18

Qwen/Qwen3-VL-8B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated 10 days ago • 5.88k • 9

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 8.33M • 551

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 82.9k • 97

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 9 days ago • 1.89k • 34

Qwen/Qwen3-VL-4B-Thinking

Image-Text-to-Text • 4B • Updated 27 days ago • 48.2k • 79

Qwen/Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated 21 days ago • 999k • 116

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8

Image-Text-to-Text • 13B • Updated 1 day ago • 15.8k • 38