Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations Paper • 2506.04633 • Published 5 days ago • 18
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Paper • 2506.04207 • Published 5 days ago • 44
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper • 2505.24760 • Published 10 days ago • 61
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 7 days ago • 139
ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper • 2506.00539 • Published 10 days ago • 28
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Paper • 2505.19897 • Published 15 days ago • 101
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published 15 days ago • 64
Scaling Image and Video Generation via Test-Time Evolutionary Search Paper • 2505.17618 • Published 18 days ago • 39
FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow Paper • 2505.17399 • Published 18 days ago • 14
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published 20 days ago • 61
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published Dec 23, 2024 • 44
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Paper • 2503.18892 • Published Mar 24 • 30
Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Paper • 2505.05464 • Published May 8 • 10
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published 19 days ago • 32
Laser Collection The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping" • 13 items • Updated 19 days ago
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published 19 days ago • 32
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published 19 days ago • 32 • 3
Laser Collection The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping" • 13 items • Updated 19 days ago