Bridging Supervised Learning and Reinforcement Learning in Math Reasoning Paper • 2505.18116 • Published 17 days ago • 4
OpenR1-Math Collection Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 3 items • Updated 28 days ago • 9