Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Tina-Yi
/
R1-Distill-Qwen-1.5B-LIMR-5e-7-lr
like
0
Follow
Tina
55
Question Answering
PEFT
Safetensors
GAIR/LIMR
English
Chinese
reasoning
arxiv:
2504.15777
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
R1-Distill-Qwen-1.5B-LIMR-5e-7-lr
Commit History
Update README.md
9a8763d
verified
upup-ashton-wang
commited on
Jul 8
Update README.md
6cbad2e
verified
upup-ashton-wang
commited on
Apr 22
clean up
697e02e
verified
upup-ashton-wang
commited on
Apr 22
clean up
8529af4
verified
upup-ashton-wang
commited on
Apr 22
clean up
de260e3
verified
upup-ashton-wang
commited on
Apr 22
clean up
def7bb7
verified
upup-ashton-wang
commited on
Apr 22
add post-trained ckpts from 50 to 200
f24bf4a
upup-ashton-wang
commited on
Apr 8
initial commit
2fd44cd
verified
upup-ashton-wang
commited on
Apr 8