LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning Paper • 2509.24786 • Published Sep 29 • 5
RecA Collection Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning! • 8 items • Updated Sep 22 • 12
EgoExo-Fitness Collection EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding • 2 items • Updated Jul 26
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness Paper • 2503.21755 • Published Mar 27 • 33
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper • 2412.09501 • Published Dec 12, 2024 • 48
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding Paper • 2406.08877 • Published Jun 13, 2024