AI & ML interests
None defined yet.
Recent Activity
View all activity
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 51 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 88 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.91k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.88k • 1
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 51 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 88 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.91k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.88k • 1