Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.

GitHub: https://github.com/mbzuai-oryx/Video-ChatGPT

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Spaces using MBZUAI/Video-ChatGPT-7B 5

Collection including MBZUAI/Video-ChatGPT-7B