Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
TΓΌrkΓ§e VLMler
Feb 14 Releases π
Feb 7 Releases π§£
January 31 Releases π§€
Models, Jan 27
Jan 24 Releases
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Multimodal DSE Retrievers
updated
25 days ago
A collection of DSE models for multimodal retrieval
Upvote
14
+4
racineai/Flantier-SmolVLM-2B-dse
Updated
Mar 26
β’
10
β’
7
MrLight/dse-qwen2-2b-mrl-v1
Visual Document Retrieval
β’
Updated
Feb 26
β’
4.76k
β’
58
marco/mcdse-2b-v1
Updated
Oct 29, 2024
β’
3.78k
β’
54
llamaindex/vdr-2b-multi-v1
Image-Text-to-Text
β’
Updated
30 days ago
β’
4.68k
β’
113
llamaindex/vdr-2b-v1
Image-Text-to-Text
β’
Updated
Jan 10
β’
465
β’
13
Upvote
14
+10
Share collection
View history
Collection guide
Browse collections