Edit Models filters

Inference Providers

HF Inference API

Misc

video-text-to-text

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

342

Full-text search

Active filters: video-text-to-text

OpenGVLab/InternVideo2-Chat-8B

Video-Text-to-Text • 8B • Updated Oct 10, 2024 • 344 • 23

OpenGVLab/InternVideo2_chat_8B_HD

Video-Text-to-Text • 8B • Updated Dec 18, 2024 • 112 • 18

OpenGVLab/videochat2

Video-Text-to-Text • Updated Aug 14, 2024 • 3

OpenGVLab/InternVideo2_Chat_8B_InternLM2_5

Video-Text-to-Text • 9B • Updated Sep 19, 2024 • 59 • 7

francisapzii/bibibigmodel

Video-Text-to-Text • Updated Aug 28, 2024

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 39.4k • 112

LeroyDyer/_Spydaz_Web_AI_LlavaNextVideo

Video-Text-to-Text • 7B • Updated Sep 19, 2024 • 2 • 1

zai-org/cogvlm2-llama3-caption

Video-Text-to-Text • 13B • Updated May 14 • 1.95k • 107

THUdyh/Oryx-34B

Video-Text-to-Text • 35B • Updated Mar 1 • 3

OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf

Video-Text-to-Text • 8B • Updated Dec 19, 2024 • 67 • 3

PolyU-ChenLab/ETChat-Phi3-Mini-Stage-1

Video-Text-to-Text • 5B • Updated Oct 29, 2024 • 2 • 1

wchai/AuroraCap-7B-VID-xtuner

Video-Text-to-Text • 7B • Updated Oct 7, 2024 • 43 • 5

kiddobellamy/Llama_Vision

Video-Text-to-Text • Updated Sep 28, 2024 • 3 • 1

Neurazum/Xbai-Epilepsy-1.0

Video-Text-to-Text • Updated Apr 22 • 2

DAMO-NLP-SG/VideoLLaMA2.1-7B-16F

Video-Text-to-Text • 8B • Updated Sep 4 • 6.06k • 10

Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • 8B • Updated Feb 28 • 134 • 73

Vision-CAIR/LongVU_Llama3_2_3B

Video-Text-to-Text • Updated Feb 28 • 15 • 8

THUdyh/Oryx-1.5-7B

Video-Text-to-Text • 8B • Updated Mar 1 • 3 • 8

Vision-CAIR/LongVU_Llama3_2_1B

Video-Text-to-Text • Updated Feb 28 • 6 • 12

jadechoghari/LongVU_Qwen2_7B

Video-Text-to-Text • 8B • Updated Oct 31, 2024 • 6 • 1

jadechoghari/LongVU_Llama3_2_1B

Video-Text-to-Text • Updated Nov 4, 2024

jadechoghari/LongVU_Llama3_2_3B

Video-Text-to-Text • Updated Nov 4, 2024 • 1

wangyueqian/MMDuet

Video-Text-to-Text • Updated Nov 28, 2024 • 20 • 4

xjtupanda/MiniCPM-V-200K-video-finetune

Video-Text-to-Text • 9B • Updated Jul 24 • 7

xjtupanda/MiniCPM-V-30K-mix-finetune

Video-Text-to-Text • 9B • Updated Jul 24 • 8

TIGER-Lab/VideoScore-v1.1

Video-Text-to-Text • 8B • Updated Feb 13 • 256 • 7

TIGER-Lab/VISTA-LongVA

Video-Text-to-Text • 8B • Updated Mar 14 • 2

TIGER-Lab/VISTA-Mantis

Video-Text-to-Text • 8B • Updated Mar 14 • 1

TIGER-Lab/VISTA-VideoLLaVA

Video-Text-to-Text • 7B • Updated Mar 14 • 1

Inst-IT/LLaVA-Next-Inst-It-Vicuna-7B

Video-Text-to-Text • 7B • Updated Feb 20 • 5 • 2