MoIIE: Mixture of Intra- and Inter-Modality Experts for Large Vision Language Models Paper • 2508.09779 • Published 11 days ago
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published Jun 8 • 3
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference Paper • 2412.12785 • Published Dec 17, 2024
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published Jun 10 • 35