L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 2 items • Updated 19 days ago • 4
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 44
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 • 31
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 12 days ago • 77