SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 9 days ago • 158
VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks Paper • 2504.05118 • Published 9 days ago • 24
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 16 days ago • 241
Expanding RL with Verifiable Rewards Across Diverse Domains Paper • 2503.23829 • Published 16 days ago • 18