Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
2 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Feb 3
•
1.17k
•
185
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
16 days ago
•
6.58k
•
269
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
22.3k
•
764
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
2.61k
•
1.64k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Jan 27
•
28.4k
•
580
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Jan 27
•
3.8k
•
144
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
1.39k
•
513
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Jan 9
•
133k
•
1.07k
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
76.9k
•
1.42k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
36
•
19
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
363
•
62
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
Feb 5
•
5.09k
•
181
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
178k
•
•
563
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
16.6k
•
301
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
99k
•
514
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 11
•
42.1k
•
64
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
Jan 20
•
1.77k
•
20
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
25.5k
•
114
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
7 days ago
•
293k
•
•
370
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
3.38M
•
•
664
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
24 days ago
•
16.1k
•
44
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
Jan 28
•
1k
•
45
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
189k
•
171
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
17 days ago
•
193k
•
543
microsoft/Magma-8B
Image-Text-to-Text
•
Updated
8 days ago
•
12.3k
•
330
marco/mcdse-2b-v1
Updated
Oct 29, 2024
•
6.49k
•
54
Upvote
-
Share collection
View history
Collection guide
Browse collections