Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper • 2506.23918 • Published 14 days ago • 76
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published 11 days ago • 90
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published 12 days ago • 72
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published 23 days ago • 61
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Paper • 2507.01945 • Published 11 days ago • 72
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published 13 days ago • 64
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 12 days ago • 179
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published 20 days ago • 56