Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
SambaNova
Novita
Fireworks
Cohere
Nebius AI Studio
Together AI
fal
Cerebras
Hyperbolic
HF Inference API
Misc
Reset Misc
Inference Endpoints
custom_code
AutoTrain Compatible
visual-question-answering
text-generation-inference
4-bit precision
8-bit precision
Merge
Mixture of Experts
Misc with no match
Eval Results
text-embeddings-inference
Carbon Emissions
Apply filters
Models
566
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering
Clear all
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
Updated
8 days ago
•
688k
•
1.3k
DAMO-NLP-SG/VideoLLaMA3-7B
Visual Question Answering
•
Updated
28 days ago
•
42.2k
•
50
TIGER-Lab/VL-Rethinker-7B
Visual Question Answering
•
Updated
2 days ago
•
165
•
5
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Feb 3
•
713k
•
357
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
933
•
1.66k
LZXzju/Qwen2.5-VL-3B-UI-R1
Visual Question Answering
•
Updated
17 days ago
•
74
•
6
dandelin/vilt-b32-finetuned-vqa
Visual Question Answering
•
Updated
Aug 2, 2022
•
77.4k
•
407
Salesforce/blip-vqa-base
Visual Question Answering
•
Updated
Feb 3
•
1.72M
•
152
Salesforce/blip-vqa-capfilt-large
Visual Question Answering
•
Updated
Feb 3
•
54.5k
•
51
Salesforce/blip2-flan-t5-xl
Image-Text-to-Text
•
Updated
Feb 3
•
112k
•
66
Salesforce/blip2-opt-6.7b
Image-Text-to-Text
•
Updated
Feb 3
•
6.27k
•
75
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text
•
Updated
Feb 3
•
7.21k
•
88
google/pix2struct-docvqa-base
Visual Question Answering
•
Updated
Dec 24, 2023
•
8.69k
•
38
google/pix2struct-infographics-vqa-large
Visual Question Answering
•
Updated
May 19, 2023
•
131
•
10
google/pix2struct-screen2words-base
Visual Question Answering
•
Updated
May 19, 2023
•
195
•
24
google/pix2struct-screen2words-large
Visual Question Answering
•
Updated
May 19, 2023
•
129
•
19
google/matcha-chart2text-pew
Visual Question Answering
•
Updated
Jul 22, 2023
•
243
•
39
google/matcha-chartqa
Visual Question Answering
•
Updated
Jul 22, 2023
•
964
•
41
google/matcha-base
Visual Question Answering
•
Updated
Jul 22, 2023
•
1.27k
•
26
google/deplot
Visual Question Answering
•
Updated
Sep 6, 2023
•
7.57k
•
298
IDEA-CCNL/Ziya-BLIP2-14B-Visual-v1
Visual Question Answering
•
Updated
Jun 7, 2023
•
55
•
57
paragon-AI/blip2-image-to-text
Image-to-Text
•
Updated
Jun 24, 2023
•
288
•
27
Gregor/mblip-mt0-xl
Image-to-Text
•
Updated
May 7, 2024
•
1.57k
•
14
merve/blip2-opt-6.7b
Image-to-Text
•
Updated
Oct 4, 2023
•
75
•
2
dineshcr7/med-VQA-1
Visual Question Answering
•
Updated
Oct 28, 2023
•
83
•
1
kpyu/eilev-blip2-opt-2.7b
Image-to-Text
•
Updated
Oct 22, 2024
•
290
•
4
unum-cloud/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
176
•
44
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering
•
Updated
Apr 12, 2024
•
2.1k
•
82
openbmb/MiniCPM-V
Visual Question Answering
•
Updated
Jan 15
•
31.3k
•
173
openbmb/OmniLMM-12B
Visual Question Answering
•
Updated
Apr 16, 2024
•
256
•
72
Previous
1
2
3
...
19
Next