akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • 2B • Updated Apr 19 • 1.74k • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner_Mini Text Generation • 2B • Updated 27 days ago • 14
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SplitReasoner Text Generation • 2B • Updated Apr 22 • 5