Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
10,060
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
172k
•
71
TIGER-Lab/VLM2Vec-Qwen2VL-7B
Image-Text-to-Text
•
Updated
May 3
•
569
•
6
google/gemma-3-12b-pt
Image-Text-to-Text
•
Updated
Mar 21
•
19.5k
•
52
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text
•
Updated
25 days ago
•
48.9k
•
126
mlabonne/gemma-3-4b-it-abliterated-GGUF
Image-Text-to-Text
•
Updated
Mar 17
•
1.69k
•
19
Mungert/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
25 days ago
•
24.9k
•
16
Mungert/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text
•
Updated
25 days ago
•
3.35k
•
9
meta-llama/Llama-4-Maverick-17B-128E
Image-Text-to-Text
•
Updated
Apr 9
•
1.36k
•
76
gaunernst/gemma-3-27b-it-qat-autoawq
Image-Text-to-Text
•
Updated
Apr 20
•
1.24k
•
7
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF
Image-Text-to-Text
•
Updated
8 days ago
•
44.2k
•
23
OpenGVLab/InternVL3-8B
Image-Text-to-Text
•
Updated
8 days ago
•
240k
•
64
OpenGVLab/InternVL3-2B
Image-Text-to-Text
•
Updated
8 days ago
•
170k
•
27
starriver030515/FUSION-X-Phi3.5-3B
Image-Text-to-Text
•
Updated
Apr 15
•
60
•
2
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
Updated
about 8 hours ago
•
2.02k
•
14
Skywork/Skywork-R1V2-38B-AWQ
Image-Text-to-Text
•
Updated
Apr 28
•
258
•
10
prithivMLmods/docscopeOCR-7B-050425-exp
Image-Text-to-Text
•
Updated
5 days ago
•
573
•
2
hal-utokyo/MangaLMM
Image-Text-to-Text
•
Updated
5 days ago
•
634
•
4
rp-yu/Dimple-7B
Image-Text-to-Text
•
Updated
11 days ago
•
731
•
6
ChenShawn/DeepEyes-7B
Image-Text-to-Text
•
Updated
15 days ago
•
473
•
2
BSC-LT/salamandra-7b-vision
Image-Text-to-Text
•
Updated
16 days ago
•
17
•
2
WaltonFuture/Qwen2.5-VL-7B-MM-UPT-MMR1
Image-Text-to-Text
•
Updated
1 day ago
•
26
•
2
kingabzpro/medgemma-brain-cancer
Image-Text-to-Text
•
Updated
9 days ago
•
7
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text
•
Updated
8 days ago
•
165
•
3
mlabonne/gemma-3-4b-it-abliterated-v2-GGUF
Image-Text-to-Text
•
Updated
8 days ago
•
893
•
3
lmstudio-community/medgemma-4b-it-GGUF
Image-Text-to-Text
•
Updated
8 days ago
•
980
•
4
OpenGVLab/ZeroGUI-AndroidLab-7B
Image-Text-to-Text
•
Updated
7 days ago
•
33
•
2
Mungert/medgemma-4b-it-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
728
•
2
VIDraft/Gemma-3-R1984-4B
Image-Text-to-Text
•
Updated
Apr 10
•
701
•
19
Akajackson/donut_rus
Image-Text-to-Text
•
Updated
Apr 27, 2023
•
638
•
11
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text
•
Updated
Feb 3
•
17.1k
•
94
Previous
1
2
3
4
5
6
...
100
Next