Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published Mar 11 • 15
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24 • 1.25M • • 1.25k
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated about 1 month ago • 552k • • 810