Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-7B-GRPO-Base-96Action_414
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-GRPO-Base-96Action_414
Commit History
End of training
bda73c0
verified
luckeciano
commited on
11 days ago
Model save
31d84d7
verified
luckeciano
commited on
11 days ago
Training in progress, step 100
a217e53
verified
luckeciano
commited on
11 days ago
Training in progress, step 90
71d0cfe
verified
luckeciano
commited on
11 days ago
Training in progress, step 80
a648ff8
verified
luckeciano
commited on
11 days ago
Training in progress, step 70
12fe8e2
verified
luckeciano
commited on
11 days ago
Training in progress, step 60
c378b13
verified
luckeciano
commited on
11 days ago
Training in progress, step 50
3639d34
verified
luckeciano
commited on
11 days ago
Training in progress, step 40
abe83a8
verified
luckeciano
commited on
11 days ago
Training in progress, step 30
52fe714
verified
luckeciano
commited on
11 days ago
Training in progress, step 20
66d4ffe
verified
luckeciano
commited on
11 days ago
Training in progress, step 10
c5cfa07
verified
luckeciano
commited on
11 days ago
initial commit
932fea5
verified
luckeciano
commited on
11 days ago