WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale Paper • 2502.16684 • Published Feb 23
Through the Valley: Path to Effective Long CoT Training for Small Language Models Paper • 2506.07712 • Published 20 days ago • 18