InternVideo2
updated
InternVideo2: Scaling Video Foundation Models for Multimodal Video
Understanding
Paper
• 2403.15377
• Published
• 28
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
• 8B • Updated
• 219
• 24
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
• 8B • Updated
• 142
• 18
OpenGVLab/InternVideo2_Chat_8B_InternLM2_5
Video-Text-to-Text
• 9B • Updated
• 27
• 7
OpenGVLab/InternVideo2_distillation_models
Updated
• 5
• 1
OpenGVLab/InternVideo2-Stage2_1B-224p-f4
Updated
• 20
OpenGVLab/InternVideo2-Stage1-1B-224p-f8
OpenGVLab/InternVideo2-Stage1-1B-224p-f8-k710
OpenGVLab/InternVideo2-CLIP-1B-224p-f8
OpenGVLab/InternVideo2-Stage1-1B-224p-K400
Video Classification
• Updated
• 4
OpenGVLab/InternVideo2-Stage1-1B-224p-K600
OpenGVLab/InternVideo2-Stage1-1B-224p-K700
OpenGVLab/InternVideo2-Stage1-1B-224p-f8-SthSth
Updated
OpenGVLab/InternVideo2-Stage1-1B-224p-f8-MiT
OpenGVLab/InternVideo2_Vid_Text
Viewer
• Updated
• 40.5M • 20
• 13
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
• 8B • Updated
• 4.12k
• 88
OpenGVLab/InternVideo2-Stage2_6B
Video Classification
• 6B • Updated
• 2.25k
• 1
OpenGVLab/InternVideo2-Stage2_6B-224p-f4
OpenGVLab/InternVideo2-CLIP-6B-224p-f8
OpenGVLab/InternVideo2_CLIP_S
0.4B • Updated
• 158
• 1
OpenGVLab/InternVideo2_chat_8B_HD_F16
Video-Text-to-Text
• Updated
• 3
• 2