metadata
license: apache-2.0
datasets:
- lmms-lab/LLaVA-Video-178K
- ShareGPT4Video/ShareGPT4Video
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2-7B
- lmms-lab/llava-onevision-qwen2-7b-ov
pipeline_tag: video-text-to-text
library_name: transformers
Improving LLM Video Understanding with 16 Frames Per Second
Official model release of Improving LLM Video Understanding with 16 Frames Per Second