Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-7B-GRPO-Base-96Action_320
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-2.5-7B-GRPO-Base-96Action_320
Commit History
End of training
2699724
verified
luckeciano
commited on
11 days ago
Model save
d9164cc
verified
luckeciano
commited on
11 days ago
Training in progress, step 100
de0bf27
verified
luckeciano
commited on
11 days ago
Training in progress, step 90
4067f8d
verified
luckeciano
commited on
11 days ago
Training in progress, step 80
4dca895
verified
luckeciano
commited on
11 days ago
Training in progress, step 70
5b62721
verified
luckeciano
commited on
11 days ago
Training in progress, step 60
a9ab632
verified
luckeciano
commited on
11 days ago
Training in progress, step 50
371e744
verified
luckeciano
commited on
11 days ago
Training in progress, step 40
b703d5e
verified
luckeciano
commited on
11 days ago
Training in progress, step 30
db841a0
verified
luckeciano
commited on
11 days ago
Training in progress, step 20
6aae059
verified
luckeciano
commited on
11 days ago
Training in progress, step 10
b6069e1
verified
luckeciano
commited on
11 days ago
initial commit
7828516
verified
luckeciano
commited on
11 days ago