Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
January 31 Releases ๐งค
Models, Jan 27
Jan 24 Releases
Jan 17 Releases โ๏ธ
Jan 10 Releases ๐จ๏ธ
Dec 6 Releases ๐
Nov 29 Releases ๐ฒ๐ฒ
Nov 22 Releases โ๏ธ
Nov 15 Releases ๐
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS ๐ชท
New Depth Models
BRAVE Models ๐ฆ
Computer Vision Backbones ๐งฉ
Image Classification Models ๐ถ ๐ฑ
Object Detection Models ๐ฅฅ
Image Segmentation Models ๐
Zero-shot Image Classification Models ๐ผ๏ธ
Image-to-Image Models ๐จ
Video Classification Models ๐บ
Image-to-Text Models ๐
Text-to-Image Models ๐ฅ
Foundation Models for Vision ๐งฉ
Segment Anything Model
OWL-series ๐ฆ
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Models, Jan 27
updated
3 days ago
Upvote
1
Running
on
Zero
246
๐ฅ
Qwen2-VL-7B
Running
27
๐
UI-TARS
Running
51
๐ป
Qwen2.5-1M Demo
Qwen/Qwen2.5-14B-Instruct-1M
Text Generation
โข
Updated
1 day ago
โข
3.85k
โข
181
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
โข
Updated
1 day ago
โข
15.2k
โข
142
bytedance-research/UI-TARS-72B-DPO
Image-Text-to-Text
โข
Updated
5 days ago
โข
6.1k
โข
74
bytedance-research/UI-TARS-72B-SFT
Image-Text-to-Text
โข
Updated
5 days ago
โข
270
โข
10
bytedance-research/UI-TARS-7B-SFT
Image-Text-to-Text
โข
Updated
5 days ago
โข
2.56k
โข
124
bytedance-research/UI-TARS-7B-DPO
Image-Text-to-Text
โข
Updated
5 days ago
โข
15.3k
โข
98
bytedance-research/UI-TARS-2B-SFT
Image-Text-to-Text
โข
Updated
5 days ago
โข
5.19k
โข
13
openbmb/MiniCPM-o-2_6
Any-to-Any
โข
Updated
4 days ago
โข
169k
โข
873
tencent/HunyuanVideo
Text-to-Video
โข
Updated
9 days ago
โข
7.74k
โข
1.54k
Upvote
1
Share collection
View history
Collection guide
Browse collections