Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Muennighoff
/
Qwen2.5-1.5B-hl-baseline-v8
like
0
Text Generation
Transformers
Safetensors
simplescaling/openaimath
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2.5-1.5B-hl-baseline-v8
Commit History
End of training
42d48b4
verified
Muennighoff
commited on
about 15 hours ago
Model save
b3f375c
verified
Muennighoff
commited on
about 15 hours ago
Training in progress, step 1440
19cb4ce
verified
Muennighoff
commited on
about 15 hours ago
Training in progress, step 1280
4f5194a
verified
Muennighoff
commited on
about 18 hours ago
Training in progress, step 1120
925f902
verified
Muennighoff
commited on
about 21 hours ago
Training in progress, step 960
c151dde
verified
Muennighoff
commited on
about 24 hours ago
Training in progress, step 800
4e7395d
verified
Muennighoff
commited on
1 day ago
Training in progress, step 640
a82a580
verified
Muennighoff
commited on
1 day ago
Training in progress, step 480
f46fc8e
verified
Muennighoff
commited on
1 day ago
Training in progress, step 320
d67b594
verified
Muennighoff
commited on
1 day ago
Training in progress, step 160
829da58
verified
Muennighoff
commited on
1 day ago
initial commit
908e7bc
verified
Muennighoff
commited on
2 days ago