Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Elliott
's Collections
LUFFY-RL
LUFFY-RL
updated
1 day ago
Upvote
3
Elliott/LUFFY-Qwen-Math-7B-Zero
Text Generation
•
Updated
about 5 hours ago
•
16
•
1
Elliott/Qwen2.5-Math-7B-16k-think
Updated
1 day ago
•
37
Elliott/Openr1-Math-46k-8192
Viewer
•
Updated
about 5 hours ago
•
45.8k
•
44
Learning to Reason under Off-Policy Guidance
Paper
•
2504.14945
•
Published
2 days ago
•
61
Upvote
3
Share collection
View history
Collection guide
Browse collections