Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sergiopaniego
/
Qwen2-0.5B-GRPO
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO
/
runs
/
Jan31_12-00-34_114542e8f14a
Commit History
Training in progress, step 170
f9264f2
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 160
fccc62d
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 150
6416dd4
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 140
391fe37
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 130
79fa76a
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 120
245ea15
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 110
cf9dc8b
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 100
0e04714
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 90
8fe1d7f
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 80
a3ce296
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 70
ddb0c8a
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 60
b8307b6
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 50
da92888
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 40
3fa13ab
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 30
c327386
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 20
41f3100
verified
sergiopaniego
HF Staff
commited on
Jan 31
Training in progress, step 10
ca2ae78
verified
sergiopaniego
HF Staff
commited on
Jan 31