Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Lansechen
/
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1
like
0
Text Generation
Transformers
Safetensors
chenggong1995/math3to5
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1
Commit History
End of training
464f855
verified
Lansechen
commited on
May 13
Model save
b5b84e1
verified
Lansechen
commited on
May 13
Training in progress, step 395
311fa3b
verified
Lansechen
commited on
May 13
Training in progress, step 390
da2a5b7
verified
Lansechen
commited on
May 13
Training in progress, step 375
e17b0d8
verified
Lansechen
commited on
May 13
Training in progress, step 360
d525c60
verified
Lansechen
commited on
May 13
Training in progress, step 345
41381f2
verified
Lansechen
commited on
May 13
Training in progress, step 330
a55ffdb
verified
Lansechen
commited on
May 13
Training in progress, step 315
8336369
verified
Lansechen
commited on
May 13
Training in progress, step 300
3b71a1c
verified
Lansechen
commited on
May 12
Training in progress, step 285
ea77b90
verified
Lansechen
commited on
May 12
Training in progress, step 270
3120330
verified
Lansechen
commited on
May 12
Training in progress, step 255
ed0d661
verified
Lansechen
commited on
May 12
Training in progress, step 240
0abee4e
verified
Lansechen
commited on
May 12
Training in progress, step 225
5ca1129
verified
Lansechen
commited on
May 12
Training in progress, step 210
06f37ae
verified
Lansechen
commited on
May 12
Training in progress, step 195
ac1476b
verified
Lansechen
commited on
May 12
Training in progress, step 180
83ef93a
verified
Lansechen
commited on
May 12
Training in progress, step 165
1e8b552
verified
Lansechen
commited on
May 12
Training in progress, step 150
d9b15f5
verified
Lansechen
commited on
May 12
Training in progress, step 135
13dbe06
verified
Lansechen
commited on
May 12
Training in progress, step 120
6d94425
verified
Lansechen
commited on
May 12
Training in progress, step 105
17421e9
verified
Lansechen
commited on
May 12
Training in progress, step 90
bfe2dcb
verified
Lansechen
commited on
May 12
Training in progress, step 75
31ff965
verified
Lansechen
commited on
May 12
Training in progress, step 60
bcea331
verified
Lansechen
commited on
May 12
Training in progress, step 45
7ed6af2
verified
Lansechen
commited on
May 12
Training in progress, step 30
0a29062
verified
Lansechen
commited on
May 12
Training in progress, step 15
31d7b57
verified
Lansechen
commited on
May 12
initial commit
cdae864
verified
Lansechen
commited on
May 12