Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-7B-GRPO-Base-1Action_501
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-GRPO-Base-1Action_501
Commit History
End of training
5083439
verified
luckeciano
commited on
13 days ago
Model save
416ce93
verified
luckeciano
commited on
13 days ago
Training in progress, step 100
637b2a8
verified
luckeciano
commited on
13 days ago
Training in progress, step 90
67d1e6f
verified
luckeciano
commited on
13 days ago
Training in progress, step 80
d76639b
verified
luckeciano
commited on
13 days ago
Training in progress, step 70
474cb27
verified
luckeciano
commited on
13 days ago
Training in progress, step 60
5cca24b
verified
luckeciano
commited on
13 days ago
Training in progress, step 50
91b8ac4
verified
luckeciano
commited on
13 days ago
Training in progress, step 40
1c58381
verified
luckeciano
commited on
13 days ago
Training in progress, step 30
4c6cdb5
verified
luckeciano
commited on
13 days ago
Training in progress, step 20
11d21bb
verified
luckeciano
commited on
13 days ago
Training in progress, step 10
356f5f3
verified
luckeciano
commited on
13 days ago
initial commit
a618ff9
verified
luckeciano
commited on
13 days ago