Spaces:
Running
Running
title: README | |
emoji: π | |
colorFrom: gray | |
colorTo: indigo | |
sdk: static | |
pinned: false | |
π OctoThinker is led by [GAIR](https://huggingface.co/GAIR) | |
π― Our Goal: To reshape the pre-training trajectory so models scale better under RL. | |