Seongyun
/

DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_5

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_5

Commit History

Training in progress, step 5000

c2eeef2
verified

Seongyun commited on Mar 10

initial commit

af60563
verified

Seongyun commited on Mar 9