Video-Text-to-Text
Transformers
English
qwen2
text-generation

Improving LLM Video Understanding with 16 Frames Per Second

Official model release of Improving LLM Video Understanding with 16 Frames Per Second

Downloads last month
11
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tsinghua-ee/F-16

Base model

Qwen/Qwen2-7B
Finetuned
(70)
this model

Datasets used to train tsinghua-ee/F-16