metadata

license: apache-2.0
datasets:
  - lmms-lab/LLaVA-Video-178K
  - ShareGPT4Video/ShareGPT4Video
language:
  - en
metrics:
  - accuracy
base_model:
  - Qwen/Qwen2-7B
  - lmms-lab/llava-onevision-qwen2-7b-ov
pipeline_tag: video-text-to-text
library_name: transformers

Improving LLM Video Understanding with 16 Frames Per Second

Official model release of Improving LLM Video Understanding with 16 Frames Per Second