Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mgaimm
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
/
runs
Commit History
Training in progress, step 450
04862a4
verified
mgaimm
commited on
Feb 1
Training in progress, step 425
03c9f18
verified
mgaimm
commited on
Feb 1
Training in progress, step 400
970acc5
verified
mgaimm
commited on
Feb 1
Training in progress, step 375
365fecb
verified
mgaimm
commited on
Feb 1
Training in progress, step 350
8beaa2c
verified
mgaimm
commited on
Feb 1
Training in progress, step 325
b2b340a
verified
mgaimm
commited on
Feb 1
Training in progress, step 300
8d3335f
verified
mgaimm
commited on
Feb 1
Training in progress, step 275
b662851
verified
mgaimm
commited on
Feb 1
Training in progress, step 250
28bc214
verified
mgaimm
commited on
Feb 1
Training in progress, step 225
907e7a8
verified
mgaimm
commited on
Feb 1
Training in progress, step 200
d576d3b
verified
mgaimm
commited on
Feb 1
Training in progress, step 175
5fa8233
verified
mgaimm
commited on
Feb 1
Training in progress, step 150
18a71d1
verified
mgaimm
commited on
Feb 1
Training in progress, step 125
8b4a762
verified
mgaimm
commited on
Feb 1
Training in progress, step 100
ecb29aa
verified
mgaimm
commited on
Feb 1
Training in progress, step 75
fdd02ad
verified
mgaimm
commited on
Feb 1
Training in progress, step 50
7bfc959
verified
mgaimm
commited on
Feb 1
Training in progress, step 25
ba66d4f
verified
mgaimm
commited on
Feb 1