Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jdqqjr
/
DeepSeek-R1-Distill-Qwen-1.5B-FactGRPO-2reward-SubLenCheck-SingleBox-0.15E-40_30_150-kl-rebuild
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
DeepSeek-R1-Distill-Qwen-1.5B-FactGRPO-2reward-SubLenCheck-SingleBox-0.15E-40_30_150-kl-rebuild
Commit History
Upload folder using huggingface_hub
0feded0
verified
jdqqjr
commited on
Mar 26
initial commit
c25fa5f
verified
jdqqjr
commited on
Mar 26