GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published 7 days ago • 44
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis Paper • 2506.02096 • Published 9 days ago • 51
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning Paper • 2505.23504 • Published 13 days ago • 6
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published 10 days ago • 36
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper • 2505.21189 • Published 15 days ago • 60
One RL to See Them All: Visual Triple Unified Reinforcement Learning Paper • 2505.18129 • Published 19 days ago • 59
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published 19 days ago • 64
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning Paper • 2505.17667 • Published 19 days ago • 87
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper • 2505.19147 • Published 17 days ago • 145
Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start Paper • 2505.22334 • Published 14 days ago • 36
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published 28 days ago • 64
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6 • 170