Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-OneVision
updated
Oct 5, 2024
a model good at arbitrary types of visual input
Upvote
22
+12
LLaVA-OneVision: Easy Visual Task Transfer
Paper
•
2408.03326
•
Published
Aug 6, 2024
•
60
lmms-lab/LLaVA-OneVision-Mid-Data
Viewer
•
Updated
Aug 26, 2024
•
563k
•
331
•
16
lmms-lab/LLaVA-OneVision-Data
Viewer
•
Updated
Oct 22, 2024
•
3.72M
•
9.45k
•
160
lmms-lab/LLaVA-NeXT-Data
Viewer
•
Updated
Aug 30, 2024
•
779k
•
2.39k
•
28
lmms-lab/llavanext-qwen-siglip-tokenizer
Text Generation
•
Updated
Jul 11, 2024
•
277
•
3
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
Updated
Sep 2, 2024
•
9.02k
•
13
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2, 2024
•
56.8k
•
15
lmms-lab/llava-onevision-qwen2-7b-si
Text Generation
•
Updated
Sep 2, 2024
•
13.7k
•
12
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2, 2024
•
237k
•
44
lmms-lab/llava-onevision-qwen2-72b-si
Text Generation
•
Updated
Sep 2, 2024
•
342
•
1
lmms-lab/llava-onevision-qwen2-72b-ov-sft
Text Generation
•
Updated
Sep 2, 2024
•
2.79k
•
14
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
578
•
8
lmms-lab/llava-onevision-projectors
Updated
Aug 14, 2024
•
3
lmms-lab/llava-onevision-qwen2-0.5b-mid-stage-a4
Updated
Aug 6, 2024
•
131
lmms-lab/llava-onevision-qwen2-7b-mid-stage-a4
Updated
Aug 6, 2024
•
3.34k
Upvote
22
+18
Share collection
View history
Collection guide
Browse collections