From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models Paper • 2401.02777 • Published Jan 5, 2024 • 1
1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training Paper • 2503.19633 • Published Mar 25
How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study Paper • 2504.00829 • Published Apr 1
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability Paper • 2504.09639 • Published Apr 13
AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Paper • 2505.08311 • Published 23 days ago • 16
Not All Correct Answers Are Equal: Why Your Distillation Source Matters Paper • 2505.14464 • Published 16 days ago • 8