Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Fireworks
Cerebras
Nebius AI
Novita
fal
Nscale
Groq
Together AI
Hyperbolic
Featherless AI
SambaNova
Zai
Replicate
Cohere
Public AI
Scaleway
HF Inference API
Misc
Reset Misc
video-text-to-text
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
342
Full-text search
Edit filters
Sort: Trending
Active filters:
video-text-to-text
Clear all
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
8B
•
Updated
Oct 10, 2024
•
344
•
23
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
8B
•
Updated
Dec 18, 2024
•
112
•
18
OpenGVLab/videochat2
Video-Text-to-Text
•
Updated
Aug 14, 2024
•
3
OpenGVLab/InternVideo2_Chat_8B_InternLM2_5
Video-Text-to-Text
•
9B
•
Updated
Sep 19, 2024
•
59
•
7
francisapzii/bibibigmodel
Video-Text-to-Text
•
Updated
Aug 28, 2024
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
8B
•
Updated
Oct 25, 2024
•
39.4k
•
112
LeroyDyer/_Spydaz_Web_AI_LlavaNextVideo
Video-Text-to-Text
•
7B
•
Updated
Sep 19, 2024
•
2
•
1
zai-org/cogvlm2-llama3-caption
Video-Text-to-Text
•
13B
•
Updated
May 14
•
1.95k
•
107
THUdyh/Oryx-34B
Video-Text-to-Text
•
35B
•
Updated
Mar 1
•
3
OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf
Video-Text-to-Text
•
8B
•
Updated
Dec 19, 2024
•
67
•
3
PolyU-ChenLab/ETChat-Phi3-Mini-Stage-1
Video-Text-to-Text
•
5B
•
Updated
Oct 29, 2024
•
2
•
1
wchai/AuroraCap-7B-VID-xtuner
Video-Text-to-Text
•
7B
•
Updated
Oct 7, 2024
•
43
•
5
kiddobellamy/Llama_Vision
Video-Text-to-Text
•
Updated
Sep 28, 2024
•
3
•
1
Neurazum/Xbai-Epilepsy-1.0
Video-Text-to-Text
•
Updated
Apr 22
•
2
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F
Video-Text-to-Text
•
8B
•
Updated
Sep 4
•
6.06k
•
10
Vision-CAIR/LongVU_Qwen2_7B
Video-Text-to-Text
•
8B
•
Updated
Feb 28
•
134
•
73
Vision-CAIR/LongVU_Llama3_2_3B
Video-Text-to-Text
•
Updated
Feb 28
•
15
•
8
THUdyh/Oryx-1.5-7B
Video-Text-to-Text
•
8B
•
Updated
Mar 1
•
3
•
8
Vision-CAIR/LongVU_Llama3_2_1B
Video-Text-to-Text
•
Updated
Feb 28
•
6
•
12
jadechoghari/LongVU_Qwen2_7B
Video-Text-to-Text
•
8B
•
Updated
Oct 31, 2024
•
6
•
1
jadechoghari/LongVU_Llama3_2_1B
Video-Text-to-Text
•
Updated
Nov 4, 2024
jadechoghari/LongVU_Llama3_2_3B
Video-Text-to-Text
•
Updated
Nov 4, 2024
•
1
wangyueqian/MMDuet
Video-Text-to-Text
•
Updated
Nov 28, 2024
•
20
•
4
xjtupanda/MiniCPM-V-200K-video-finetune
Video-Text-to-Text
•
9B
•
Updated
Jul 24
•
7
xjtupanda/MiniCPM-V-30K-mix-finetune
Video-Text-to-Text
•
9B
•
Updated
Jul 24
•
8
TIGER-Lab/VideoScore-v1.1
Video-Text-to-Text
•
8B
•
Updated
Feb 13
•
256
•
7
TIGER-Lab/VISTA-LongVA
Video-Text-to-Text
•
8B
•
Updated
Mar 14
•
2
TIGER-Lab/VISTA-Mantis
Video-Text-to-Text
•
8B
•
Updated
Mar 14
•
1
TIGER-Lab/VISTA-VideoLLaVA
Video-Text-to-Text
•
7B
•
Updated
Mar 14
•
1
Inst-IT/LLaVA-Next-Inst-It-Vicuna-7B
Video-Text-to-Text
•
7B
•
Updated
Feb 20
•
5
•
2
Previous
1
2
3
4
...
12
Next