Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
January 31 Releases ๐งค
Models, Jan 27
Jan 24 Releases
Jan 17 Releases โ๏ธ
Jan 10 Releases ๐จ๏ธ
Dec 6 Releases ๐
Nov 29 Releases ๐ฒ๐ฒ
Nov 22 Releases โ๏ธ
Nov 15 Releases ๐
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS ๐ชท
New Depth Models
BRAVE Models ๐ฆ
Computer Vision Backbones ๐งฉ
Image Classification Models ๐ถ ๐ฑ
Object Detection Models ๐ฅฅ
Image Segmentation Models ๐
Zero-shot Image Classification Models ๐ผ๏ธ
Image-to-Image Models ๐จ
Video Classification Models ๐บ
Image-to-Text Models ๐
Text-to-Image Models ๐ฅ
Foundation Models for Vision ๐งฉ
Segment Anything Model
OWL-series ๐ฆ
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Video Language Models
updated
Aug 1, 2024
A collection of video-language models
Upvote
2
Running
20
๐จ
Video Llava
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
โข
Updated
2 days ago
โข
43.2k
โข
62
llava-hf/LLaVA-NeXT-Video-7B-DPO-hf
Video-Text-to-Text
โข
Updated
2 days ago
โข
1.13k
โข
9
llava-hf/LLaVA-NeXT-Video-7B-32K-hf
Image-Text-to-Text
โข
Updated
2 days ago
โข
295
โข
7
Running
on
Zero
30
๐
Llava Interleave
Upvote
2
Share collection
View history
Collection guide
Browse collections