Agents of Change: Self-Evolving LLM Agents for Strategic Planning Paper • 2506.04651 • Published 17 days ago • 7
SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence Paper • 2506.15672 • Published 3 days ago • 9
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published 4 days ago • 31
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs Paper • 2506.14429 • Published 5 days ago • 40
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Paper • 2506.10521 • Published 10 days ago • 64
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 5 days ago • 219
pLSTM: parallelizable Linear Source Transition Mark networks Paper • 2506.11997 • Published 8 days ago • 8
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper • 2505.22954 • Published 24 days ago • 11
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published 22 days ago • 127
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published 22 days ago • 197
Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity Paper • 2506.09250 • Published 11 days ago • 27