Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 30 days ago • 73
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published about 1 month ago • 53
view article Article Key Insights into the Law of Vision Representations in MLLMs By Borise • Sep 2, 2024 • 17
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval Paper • 2406.04292 • Published Jun 6, 2024 • 1
MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding Paper • 2406.04264 • Published Jun 6, 2024 • 1