view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 20
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation β’ Updated about 7 hours ago β’ 1.24k β’ 101
PsycheFoundation/consilience-40b-7Y9v38s5 Text Generation β’ Updated about 15 hours ago β’ 662 β’ 17
π©βπ» OlympicCoder Collection Reasoning datasets and models for competitive coding β’ 4 items β’ Updated 23 days ago β’ 17
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated 17 days ago β’ 148
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 10 days ago β’ 40
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ 22 days ago β’ 110
INTELLECT-2 Collection INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. β’ 3 items β’ Updated 25 days ago β’ 22
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper β’ 2505.04842 β’ Published 29 days ago β’ 12 β’ 3
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper β’ 2505.04842 β’ Published 29 days ago β’ 12