Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-7B-GRPO-Base-4Action_119
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-GRPO-Base-4Action_119
Commit History
End of training
d12d5d5
verified
luckeciano
commited on
13 days ago
Model save
bfa6677
verified
luckeciano
commited on
13 days ago
Training in progress, step 100
1500cd8
verified
luckeciano
commited on
13 days ago
Training in progress, step 90
42e07d2
verified
luckeciano
commited on
13 days ago
Training in progress, step 80
0d6b2ec
verified
luckeciano
commited on
13 days ago
Training in progress, step 70
a685263
verified
luckeciano
commited on
13 days ago
Training in progress, step 60
fad9957
verified
luckeciano
commited on
13 days ago
Training in progress, step 50
a70898c
verified
luckeciano
commited on
13 days ago
Training in progress, step 40
4e0ddd2
verified
luckeciano
commited on
13 days ago
Training in progress, step 30
9a6ac98
verified
luckeciano
commited on
13 days ago
Training in progress, step 20
3bbfcca
verified
luckeciano
commited on
13 days ago
Training in progress, step 10
7e78ecc
verified
luckeciano
commited on
14 days ago
initial commit
12fe24e
verified
luckeciano
commited on
14 days ago