Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Releases July 18
Releases July 11
Releases July 4
Releases June 27
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
Türkçe VLMler
Feb 14 Releases 💌
Feb 7 Releases 🧣
January 31 Releases 🧤
Models, Jan 27
Jan 24 Releases
Jan 17 Releases ❄️
Jan 10 Releases 🌨️
Dec 6 Releases 🎄
Nov 29 Releases 🌲🌲
Nov 22 Releases ❄️
Nov 15 Releases 🍂
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS 🪷
New Depth Models
BRAVE Models 🦁
Computer Vision Backbones 🧩
Image Classification Models 🐶 🐱
Object Detection Models 🥥
Image Segmentation Models 💜
Zero-shot Image Classification Models 🖼️
Image-to-Image Models 🎨
Video Classification Models 📺
Image-to-Text Models 📝
Text-to-Image Models 🥑
Foundation Models for Vision 🧩
Segment Anything Model
OWL-series 🦉
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers 🖼️💬📝
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
October 25 Releases
updated
Oct 25, 2024
Upvote
7
ibm-granite/granite-3.0-8b-instruct
Text Generation
•
8B
•
Updated
Dec 19, 2024
•
19.6k
•
202
ibm-granite/granite-3.0-2b-instruct
Text Generation
•
3B
•
Updated
Dec 19, 2024
•
4.89k
•
46
CohereLabs/aya-expanse-8b
Text Generation
•
8B
•
Updated
Apr 15
•
20.3k
•
•
385
CohereLabs/aya-expanse-32b
Text Generation
•
32B
•
Updated
Jun 12
•
6.89k
•
•
262
genmo/mochi-1-preview
Text-to-Video
•
Updated
Dec 18, 2024
•
12.5k
•
•
1.24k
rhymes-ai/Allegro
Text-to-Video
•
Updated
Oct 31, 2024
•
240
•
261
LanguageBind/Open-Sora-Plan-v1.3.0
Text-to-Video
•
Updated
Dec 5, 2024
•
9
•
71
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
329
•
69
jadechoghari/Ferret-UI-Gemma2b
Image-Text-to-Text
•
3B
•
Updated
Oct 18, 2024
•
305
•
50
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.3k
•
1.68k
neuralwork/arxiver
Viewer
•
Updated
Nov 1, 2024
•
63.4k
•
173
•
362
neulab/Pangea-7B
8B
•
Updated
Oct 24, 2024
•
19.4k
•
129
neulab/Pangea-7B-hf
8B
•
Updated
Oct 28, 2024
•
1.05k
•
9
Running
49
49
Pangea
🚀
A Fully Open Multilingual Multimodal LLM for 39 Languages
stabilityai/stable-diffusion-3.5-large
Text-to-Image
•
Updated
Oct 22, 2024
•
121k
•
•
3.02k
stabilityai/stable-diffusion-3.5-large-turbo
Text-to-Image
•
Updated
Oct 22, 2024
•
20.7k
•
•
612
Marqo/marqo-GS-10M
Viewer
•
Updated
Oct 23, 2024
•
9.81M
•
677
•
49
vikhyatk/lofi
Viewer
•
Updated
Oct 26, 2024
•
857k
•
2.01k
•
81
neulab/PangeaInstruct
Updated
Feb 2
•
357
•
85
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections