VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning
Ye Liu
yeliudev
AI & ML interests
Vision & Language
Recent Activity
updated
a dataset
8 days ago
yeliudev/datasets
upvoted
a
paper
8 days ago
D-AR: Diffusion via Autoregressive Models
upvoted
a
paper
10 days ago
Paper2Poster: Towards Multimodal Poster Automation from Scientific
Papers
Organizations
Collections
3
spaces
2
models
7

yeliudev/VideoMind-7B
Video-Text-to-Text
•
Updated
•
31
•
3

yeliudev/VideoMind-2B
Video-Text-to-Text
•
Updated
•
342
•
1

yeliudev/VideoMind-2B-FT-QVHighlights
Video-Text-to-Text
•
Updated
•
17

yeliudev/R2-Tuning
Updated
•
1

yeliudev/CATNet
Updated

yeliudev/UMT
Updated

yeliudev/ConsNet
Updated