Video-Text-to-Text
Transformers
Safetensors
English
qwen2
text-generation
text-generation-inference
DragonAura commited on
Commit
b91c40b
·
verified ·
1 Parent(s): c8d970d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,3 +1,21 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - HuggingFaceFV/finevideo
5
+ - lmms-lab/LLaVA-Video-178K
6
+ - ShareGPT4Video/ShareGPT4Video
7
+ language:
8
+ - en
9
+ metrics:
10
+ - accuracy
11
+ base_model:
12
+ - Qwen/Qwen2-7B
13
+ - lmms-lab/llava-onevision-qwen2-7b-ov
14
+ - openai/whisper-large-v3
15
+ pipeline_tag: video-text-to-text
16
+ library_name: transformers
17
+ ---
18
+
19
+ # video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
20
+
21
+ Official model release of [video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models](https://github.com/bytedance/video-SALMONN-2)