Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sugatoray
's Collections
LLMs
LLM Tools
AV LLMs
LLM Training Datasets
Papers
Leaderboards 🔥
Papers-MoE
Papers-LLMEval
LLM LLAMA3
Papers-Fundamentals
TFM: TimeSeries Foundation Models
Papers-Benchmarks
LLMs-EmbeddingModels
LLMs + Mamba
LLM + Datasets : Finance
AV LLMs
updated
5 days ago
A collection of Audio, Video and Visual LLMs.
Upvote
2
myshell-ai/OpenVoice
Text-to-Speech
•
Updated
Apr 24
•
384
Running
935
🤗
OpenVoice
dataautogpt3/ProteusV0.3
Text-to-Image
•
Updated
Feb 12
•
40k
•
89
ByteDance/SDXL-Lightning
Text-to-Image
•
Updated
Apr 3
•
84k
•
1.88k
openai/whisper-large-v3
Automatic Speech Recognition
•
Updated
Aug 12
•
4.75M
•
•
3.42k
stabilityai/TripoSR
Image-to-3D
•
Updated
Aug 9
•
24.7k
•
440
Efficient-Large-Model/VILA-7b
Text Generation
•
Updated
Mar 4
•
1.02k
•
25
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
65.1k
•
106
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20
•
127k
•
891
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jul 31
•
21k
•
862
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
Jun 13
•
36
aiola/whisper-medusa-v1
Updated
Aug 3
•
539
•
172
merve/idefics3llama-vqav2
Updated
11 days ago
•
9
black-forest-labs/FLUX.1-schnell
Text-to-Image
•
Updated
Aug 16
•
1.18M
•
•
2.33k
Running
on
Zero
97
😻
Llama3.1 S V0.2 Checkpoint 2024 08 20
gpt-omni/mini-omni
Text-to-Speech
•
Updated
18 days ago
•
4
•
350
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
3 days ago
•
4.66k
•
339
Running
on
Zero
123
📲🫴🏻👁
Tonic's GOT OCR
GOT - OCR (from : UCAS, Beijing)
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
5 days ago
•
123k
•
509
apple/coreml-sam2-large
Mask Generation
•
Updated
9 days ago
•
55
•
9
coreml-projects/sam-2-studio
Updated
9 days ago
•
11
mistralai/Pixtral-12B-2409
Updated
5 days ago
•
9
•
281
Upvote
2
Share collection
View history
Collection guide
Browse collections