Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published 5 days ago • 59 • 7
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 11 days ago • 55 • 4
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 8 days ago • 229 • 5
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published 12 days ago • 65 • 4
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published May 20 • 130 • 4