Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Paper • 2505.22618 • Published 9 days ago • 39
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published 10 days ago • 68
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 9 days ago • 116
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper • 2505.19147 • Published 12 days ago • 144
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models Paper • 2505.18536 • Published 14 days ago • 18
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning Paper • 2505.17667 • Published 15 days ago • 85
PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Paper • 2505.15929 • Published 16 days ago • 48
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published 15 days ago • 64
Large Language Models Implicitly Learn to See and Hear Just By Reading Paper • 2505.17091 • Published 17 days ago • 5
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning Paper • 2505.16483 • Published 16 days ago • 10
VeriThinker: Learning to Verify Makes Reasoning Model Efficient Paper • 2505.17941 • Published 14 days ago • 24
Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection Paper • 2505.17558 • Published 15 days ago • 15
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 15 days ago • 77
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper • 2505.16938 • Published 15 days ago • 115
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published 17 days ago • 60
KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models Paper • 2505.16707 • Published 15 days ago • 42