MoT Experimental Reasoning Traces R1 Collection Mixture-of-Thoughts • 6 items • Updated 2 days ago • 1
The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason Paper • 2505.22653 • Published 9 days ago • 64
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers Paper • 2505.23758 • Published 8 days ago • 23
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning? Paper • 2505.23359 • Published 8 days ago • 39
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 49
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 9 days ago • 116
One-RL-to-See-Them-All Collection https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated 11 days ago • 12
Let LLMs Break Free from Overthinking via Self-Braking Tuning Paper • 2505.14604 • Published 17 days ago • 23
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published 17 days ago • 60
Cortex Dual ~ DiMind Collection (direct, reactive, retrieval-based responses), (reasoning, planning, deeper analysis) • 4 items • Updated 13 days ago • 1
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • 15 days ago • 122