Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated 24 days ago • 59
Apriel Collection ServiceNow Language Modeling Lab's first model family series • 2 items • Updated 4 days ago • 7
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 3 days ago • 49
OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 5 items • Updated 3 days ago • 6
Granite 3.3 Language Models Collection Our latest language models licensed under Apache 2.0 license. • 4 items • Updated 1 day ago • 24
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 6 items • Updated 35 minutes ago • 19
GenPRM Collection A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated 12 days ago • 5
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement Paper • 2504.07934 • Published 7 days ago • 14
ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models Paper • 2503.22048 • Published 21 days ago • 2
Synthia-S1 REASONING MODEL Collection Creative, Scientific, and Coding • 3 items • Updated 15 days ago • 3
UIGEN-T1.5 REASONING MODEL Collection UIGEN'S Next Iteration. UIGEN-T1.5 is a midway model between 1 and 2, reflecting our new data collection pipeline changes. • 5 items • Updated 25 days ago • 6
Cohere Labs Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated 2 days ago • 38