Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper β’ 2506.07044 β’ Published 3 days ago β’ 88
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? Paper β’ 2506.05287 β’ Published 5 days ago β’ 14
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models Paper β’ 2505.16854 β’ Published 20 days ago β’ 11
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper β’ 2505.13438 β’ Published 22 days ago β’ 35
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency Paper β’ 2504.18589 β’ Published Apr 24 β’ 11
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper β’ 2504.15271 β’ Published Apr 21 β’ 65
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper β’ 2503.19757 β’ Published Mar 25 β’ 50
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper β’ 2503.21696 β’ Published Mar 27 β’ 22
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper β’ 2503.14476 β’ Published Mar 18 β’ 128
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper β’ 2503.10582 β’ Published Mar 13 β’ 23
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper β’ 2503.02951 β’ Published Mar 4 β’ 32
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published Feb 20 β’ 145
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Paper β’ 2502.13922 β’ Published Feb 19 β’ 28
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Paper β’ 2502.02339 β’ Published Feb 4 β’ 22
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) β’ 15 items β’ Updated Mar 25 β’ 63
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper β’ 2501.12599 β’ Published Jan 22 β’ 118