Visual Reasoning through Tool-supervised Reinforcement Learning Paper • 2604.19945 • Published 4 days ago • 3
DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data Paper • 2604.19859 • Published 4 days ago • 44
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 3 days ago • 220
Speculative Decoding for Autoregressive Video Generation Paper • 2604.17397 • Published 6 days ago • 9
Multiplication in Multimodal LLMs: Computation with Text, Image, and Audio Inputs Paper • 2604.18203 • Published 5 days ago • 5
Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration Paper • 2604.18131 • Published 5 days ago • 8
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 5 days ago • 41
Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published 9 days ago • 8
Towards Autonomous Mechanistic Reasoning in Virtual Cells Paper • 2604.11661 • Published 11 days ago • 6