Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published 6 days ago • 71 • 7
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 12 days ago • 56 • 4
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 9 days ago • 234 • 5
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published 13 days ago • 65 • 4
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published May 20 • 130 • 4