Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yukang
/
Qwen2.5-3B-Open-R1-Code-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/verifiable-coding-problems-python
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-3B-Open-R1-Code-GRPO
/
README.md
Commit History
Training in progress, step 502
61386c1
verified
Yukang
commited on
24 days ago
End of training
d79c29c
verified
Yukang
commited on
24 days ago
Model save
0f89c36
verified
Yukang
commited on
24 days ago
Training in progress, step 501
1e19480
verified
Yukang
commited on
24 days ago
End of training
2fa493d
verified
Yukang
commited on
24 days ago
Model save
b7a256f
verified
Yukang
commited on
24 days ago
Training in progress, step 504
c35f5d3
verified
Yukang
commited on
24 days ago
End of training
d499a55
verified
Yukang
commited on
24 days ago
Model save
9fd7951
verified
Yukang
commited on
24 days ago
End of training
2bc7f30
verified
Yukang
commited on
24 days ago
Model save
66e4dbc
verified
Yukang
commited on
24 days ago
Training in progress, step 501
2730fba
verified
Yukang
commited on
24 days ago
End of training
e4287a7
verified
Yukang
commited on
24 days ago
Model save
55c4df7
verified
Yukang
commited on
24 days ago
Training in progress, step 503
cbfdb40
verified
Yukang
commited on
24 days ago
End of training
a3f2251
verified
Yukang
commited on
24 days ago
Model save
02a656c
verified
Yukang
commited on
24 days ago
End of training
ce68975
verified
Yukang
commited on
24 days ago
Model save
5e96efd
verified
Yukang
commited on
24 days ago
End of training
ebb0bc5
verified
Yukang
commited on
24 days ago
Model save
fcb5097
verified
Yukang
commited on
24 days ago
End of training
1fa3e96
verified
Yukang
commited on
24 days ago
Model save
b6653af
verified
Yukang
commited on
24 days ago
End of training
c3c1376
verified
Yukang
commited on
24 days ago
Model save
b4bc3aa
verified
Yukang
commited on
24 days ago
Previous
1
2
3
Next