Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Releases July 25
Releases July 18
Releases July 11
Releases July 4
Releases June 27
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
Türkçe VLMler
Feb 14 Releases 💌
Feb 7 Releases 🧣
January 31 Releases 🧤
Models, Jan 27
Jan 24 Releases
Jan 17 Releases ❄️
Jan 10 Releases 🌨️
Dec 6 Releases 🎄
Nov 29 Releases 🌲🌲
Nov 22 Releases ❄️
Nov 15 Releases 🍂
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS 🪷
New Depth Models
BRAVE Models 🦁
Computer Vision Backbones 🧩
Image Classification Models 🐶 🐱
Object Detection Models 🥥
Image Segmentation Models 💜
Zero-shot Image Classification Models 🖼️
Image-to-Image Models 🎨
Video Classification Models 📺
Image-to-Text Models 📝
Text-to-Image Models 🥑
Foundation Models for Vision 🧩
Segment Anything Model
OWL-series 🦉
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers 🖼️💬📝
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Feb 14 Releases 💌
updated
Feb 14
Upvote
7
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
8B
•
Updated
Feb 18
•
23.3k
•
73
AIDC-AI/Ovis2-34B
Image-Text-to-Text
•
35B
•
Updated
Feb 27
•
630
•
150
open-r1/OpenR1-Qwen-7B
Text Generation
•
8B
•
Updated
May 28
•
1.05k
•
•
53
nomic-ai/nomic-embed-text-v2-moe
Sentence Similarity
•
0.5B
•
Updated
Apr 1
•
152k
•
416
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech
•
Updated
Jun 3
•
27.9k
•
•
1.09k
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
•
2B
•
Updated
Apr 9
•
45.7k
•
568
open-r1/OpenR1-Math-Raw
Viewer
•
Updated
Feb 24
•
516k
•
162
•
74
open-r1/OpenR1-Math-220k
Viewer
•
Updated
Feb 18
•
450k
•
28.3k
•
622
Zyphra/Zonos-v0.1-transformer
Text-to-Speech
•
Updated
Jun 3
•
52.9k
•
•
410
AIDC-AI/Ovis2-1B
Image-Text-to-Text
•
1B
•
Updated
Feb 27
•
386k
•
89
AIDC-AI/Ovis2-16B
Image-Text-to-Text
•
16B
•
Updated
Feb 27
•
30k
•
99
AIDC-AI/Ovis2-2B
Image-Text-to-Text
•
2B
•
Updated
Feb 27
•
4.89k
•
59
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
9B
•
Updated
Feb 27
•
18k
•
72
AIDC-AI/Ovis2-4B
Image-Text-to-Text
•
5B
•
Updated
Feb 27
•
5.96k
•
60
sbintuitions/modernbert-ja-130m
Fill-Mask
•
0.1B
•
Updated
May 1
•
2.91k
•
•
45
Zyphra/Zonos-v0.1-speaker-embedding
Updated
Feb 12
•
28
GAIR/LIMO
33B
•
Updated
Feb 6
•
920
•
43
prithivMLmods/Hoags-2B-Exp
Image-Text-to-Text
•
2B
•
Updated
Feb 15
•
3
•
3
Metric-AI/ColQwenStella-2b-multilingual
Visual Document Retrieval
•
Updated
Mar 25
•
3
•
9
apple/DepthPro-hf
Depth Estimation
•
1.0B
•
Updated
Feb 28
•
19.2k
•
60
Liberata/illustrious-xl-v1.0
Text-to-Image
•
Updated
Feb 12
•
143
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
8B
•
Updated
Feb 13
•
2.94k
•
4
OpenGVLab/InternVL_2_5_HiCo_R64
Video-Text-to-Text
•
8B
•
Updated
May 13
•
178
•
3
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections