Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SamuraiBarbi
's Collections
jttw-ai-docker-stack-tts-models
jttw-ai-docker-stack-language-models
3d Asset Generation Models
Sound Models
Diarization Models
Utility Models
Benchmarks / Leaderboards
Settings
Embedding Models
Roleplay Models (Censored)
Writing Models
Music Models
Video Generation Models
Smart Models ( Censored )
Smart Models ( Uncensored )
Coding Models
Text to Speech Models
Art Models
Tiny Models
High Context Models
Needs Testing
Chat Models
Machine Vision Models
SQL Models
Search Models
Upscaling Models
Function Calling Models
Speech Models
Speech to Text Models
Voice Cloning Models
Image Generation Models
Machine Vision Models
updated
May 13
Upvote
-
Lin-Chen/ShareGPT4V-7B
Image-Text-to-Text
•
Updated
Jun 6, 2024
•
9.45k
•
82
Lin-Chen/ShareCaptioner
Feature Extraction
•
Updated
Jun 6, 2024
•
187
•
56
Lin-Chen/ShareGPT4V-13B
Text Generation
•
Updated
Jun 6, 2024
•
7
•
33
deepseek-ai/deepseek-vl-1.3b-base
2B
•
Updated
Mar 15, 2024
•
1.84k
•
55
deepseek-ai/deepseek-vl-1.3b-chat
Image-Text-to-Text
•
2B
•
Updated
Mar 15, 2024
•
94.5k
•
68
deepseek-ai/deepseek-vl-7b-base
7B
•
Updated
Mar 15, 2024
•
23.7k
•
63
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
•
7B
•
Updated
Mar 15, 2024
•
54.2k
•
261
xtuner/llava-llama-3-8b
Image-Text-to-Text
•
8B
•
Updated
Apr 26, 2024
•
63
•
37
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
54.4k
•
254
xtuner/llava-llama-3-8b-v1_1
Image-Text-to-Text
•
8B
•
Updated
Apr 28, 2024
•
1.1k
•
•
120
internlm/internlm-xcomposer-7b
Text Generation
•
Updated
Dec 25, 2023
•
1.33k
•
21
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
71k
•
510
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
1.53M
•
487
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
•
35B
•
Updated
May 9, 2024
•
19.8k
•
353
liuhaotian/llava-v1.6-mistral-7b
Image-Text-to-Text
•
8B
•
Updated
May 8, 2024
•
55.8k
•
239
liuhaotian/llava-v1.6-vicuna-13b
Image-Text-to-Text
•
13B
•
Updated
May 9, 2024
•
8k
•
58
liuhaotian/llava-v1.6-vicuna-7b
Image-Text-to-Text
•
7B
•
Updated
May 9, 2024
•
19.4k
•
131
yhcao/DualFocus-LLaVA-1.5-7B
Text Generation
•
Updated
Feb 22, 2024
•
4
•
2
yhcao/DualFocus-LLaVA-1.5-13B
Text Generation
•
Updated
Feb 22, 2024
•
1
•
2
yhcao/DualFocus-ShareGPT4V-13B
Text Generation
•
Updated
Feb 22, 2024
•
4
•
4
internlm/internlm-xcomposer-vl-7b
Text Generation
•
Updated
Dec 25, 2023
•
51
•
20
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering
•
Updated
Apr 12, 2024
•
1.61k
•
82
internlm/internlm-xcomposer2-vl-1_8b
Visual Question Answering
•
Updated
Apr 9, 2024
•
81
•
18
internlm/internlm-xcomposer2-4khd-7b
Visual Question Answering
•
Updated
Apr 18, 2024
•
2.39k
•
73
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
78.8k
•
369
MrDragonFox/apple-ferret-13b-merged
Text Generation
•
Updated
Jan 2, 2024
•
6
•
3
MrDragonFox/apple-ferret-7b-merged
Text Generation
•
Updated
Jan 2, 2024
•
16
•
6
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Jul 7
•
147k
•
1.25k
vikhyatk/moondream1
Text Generation
•
2B
•
Updated
Feb 7, 2024
•
30.5k
•
485
zai-org/cogvlm2-llama3-chat-19B
Text Generation
•
20B
•
Updated
Sep 3, 2024
•
4.09k
•
214
microsoft/Florence-2-large
Image-Text-to-Text
•
0.8B
•
Updated
19 days ago
•
982k
•
1.64k
microsoft/Florence-2-base
Image-Text-to-Text
•
0.2B
•
Updated
19 days ago
•
595k
•
290
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
0.8B
•
Updated
19 days ago
•
42.9k
•
364
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
0.2B
•
Updated
19 days ago
•
29.6k
•
127
nakodanei/ShareGPT4V-7B_GGUF
7B
•
Updated
Nov 24, 2023
•
43
•
12
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Apr 4
•
19.7k
•
543
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
31.7k
•
544
SkunkworksAI/BakLLaVA-1
Text Generation
•
Updated
Oct 23, 2023
•
176
•
378
fancyfeast/llama-joycaption-alpha-two-hf-llava
8B
•
Updated
Nov 29, 2024
•
32.4k
•
188
Running
on
Zero
1.45k
1.45k
Joy Caption Alpha Two
👁
Generate captions for images in various styles
Running
on
Zero
1.31k
1.31k
Joy Caption Pre Alpha
💬
Generate captions for images
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Apr 6
•
3.67M
•
492
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
4.69M
•
•
1.16k
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6
•
864k
•
•
530
mradermacher/Qwen2.5-VL-7B-Instruct-GGUF
8B
•
Updated
11 days ago
•
4.4k
•
2
mradermacher/Qwen2.5-VL-32B-Instruct-GGUF
33B
•
Updated
23 days ago
•
230
•
2
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
389k
•
•
496
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text
•
27B
•
Updated
8 days ago
•
54.2k
•
150
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
4B
•
Updated
8 days ago
•
43.1k
•
128
unsloth/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
12B
•
Updated
8 days ago
•
50.2k
•
96
unsloth/gemma-3-1b-it-GGUF
Text Generation
•
1.0B
•
Updated
May 9
•
50k
•
51
IVGSZ/Flash-VStream-7b
Text Generation
•
Updated
Jun 26, 2024
•
116
•
16
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
549k
•
•
1.22k
chenjoya/videollm-online-8b-v1plus
Video-Text-to-Text
•
Updated
Jul 13, 2024
•
1.47k
•
27
chenjoya/LiveCC-7B-Instruct
8B
•
Updated
Apr 25
•
3.04k
•
38
chenjoya/LiveCC-7B-Base
8B
•
Updated
Apr 25
•
1.42k
•
6
chenjoya/Qwen2-VL-7B-LiveCCInstruct
8B
•
Updated
Apr 14
•
8
•
1
chenjoya/Qwen2-VL-7B-LLaVAInstruct
8B
•
Updated
Apr 16
•
5
•
1
facebook/Perception-LM-8B
Image-Text-to-Text
•
10B
•
Updated
Jul 14
•
1.09k
•
52
ggml-org/SmolVLM-500M-Instruct-GGUF
0.4B
•
Updated
Apr 30
•
33.3k
•
31
ggml-org/InternVL3-14B-Instruct-GGUF
15B
•
Updated
May 10
•
687
•
3
ggml-org/InternVL3-8B-Instruct-GGUF
8B
•
Updated
May 10
•
863
•
5
ggml-org/InternVL2_5-1B-GGUF
0.6B
•
Updated
May 10
•
126
•
2
ggml-org/InternVL2_5-4B-GGUF
3B
•
Updated
May 10
•
134
•
1
ggml-org/InternVL3-1B-Instruct-GGUF
0.6B
•
Updated
May 10
•
736
•
3
ggml-org/InternVL3-2B-Instruct-GGUF
2B
•
Updated
May 10
•
708
•
5
ggml-org/Qwen2.5-VL-3B-Instruct-GGUF
3B
•
Updated
Apr 30
•
4.68k
•
5
ggml-org/pixtral-12b-GGUF
12B
•
Updated
Apr 30
•
609
•
4
ggml-org/Qwen2-VL-2B-Instruct-GGUF
2B
•
Updated
Apr 30
•
715
•
2
ggml-org/Qwen2.5-VL-7B-Instruct-GGUF
8B
•
Updated
Apr 30
•
2.86k
•
7
ggml-org/Qwen2.5-VL-32B-Instruct-GGUF
33B
•
Updated
May 15
•
747
•
3
ggml-org/SmolVLM-Instruct-GGUF
2B
•
Updated
Apr 30
•
613
•
6
ggml-org/SmolVLM-256M-Instruct-GGUF
0.2B
•
Updated
Apr 30
•
22.6k
•
6
ggml-org/SmolVLM2-2.2B-Instruct-GGUF
2B
•
Updated
Apr 30
•
6.17k
•
18
ggml-org/SmolVLM2-256M-Video-Instruct-GGUF
0.2B
•
Updated
Apr 30
•
613
•
6
ggml-org/SmolVLM2-500M-Video-Instruct-GGUF
0.4B
•
Updated
Apr 30
•
2.14k
•
11
Upvote
-
Share collection
View history
Collection guide
Browse collections