Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DatPySci
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Commit History
Training in progress, step 175
e2e0239
verified
DatPySci
commited on
Apr 28
Training in progress, step 150
b5988e3
verified
DatPySci
commited on
Apr 28
Training in progress, step 125
3c8d9fe
verified
DatPySci
commited on
Apr 28
Training in progress, step 100
8872514
verified
DatPySci
commited on
Apr 27
Training in progress, step 75
3641c75
verified
DatPySci
commited on
Apr 27
Training in progress, step 50
734cb51
verified
DatPySci
commited on
Apr 27
Training in progress, step 25
40c7923
verified
DatPySci
commited on
Apr 27
Training in progress, step 50
c221637
verified
DatPySci
commited on
Apr 26
initial commit
f4e0631
verified
DatPySci
commited on
Apr 22