Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published 21 days ago • 51
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper • 2505.20139 • Published 17 days ago • 18
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL Paper • 2505.17952 • Published 20 days ago • 20
Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models Paper • 2505.19509 • Published 17 days ago • 2
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing Paper • 2506.03197 • Published 11 days ago • 4
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning Paper • 2505.24850 • Published 13 days ago • 9