akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_14BDrafter 2B • Updated 17 days ago • 8
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet 2B • Updated 17 days ago • 52
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_32BDrafter Updated 20 days ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner_Mini Text Generation • 2B • Updated 22 days ago • 28
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SplitReasoner Text Generation • 2B • Updated Apr 22 • 10
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • 2B • Updated Apr 19 • 1.77k • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner Text Generation • 2B • Updated Apr 17 • 930
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SelfCompress_SFT Text Generation • 2B • Updated Apr 15 • 27
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • 2B • Updated Apr 15 • 8
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • 2B • Updated Apr 14 • 35
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT Text Generation • 2B • Updated Apr 10 • 26