Granite 3.3 Language Models Collection Our latest language models licensed under Apache 2.0 license. • 4 items • Updated 3 days ago • 27
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published 12 days ago • 112
Kimina Prover Preview Collection State-of-the-Art Models for Formal Mathematical Reasoning • 4 items • Updated 5 days ago • 26
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 9 days ago • 76
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 7 days ago • 61
Minueza-2-96M Collection The second version of the Minueza series. Base model and its fine-tunings. • 11 items • Updated 1 day ago • 1
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 1 day ago • 137
Tessa-T1 REACT REASONING MODEL Collection Tessa-T1 is a model that generates Stateful React with tailwind styling. It has features of other libraries as well. It is based on Qwen2.5-Coder. • 5 items • Updated 26 days ago • 6
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 30 days ago • 46
Hamanasu Collection A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 31 items • Updated 13 days ago • 8
Llama Nemotron Collection Open, Production-ready Enterprise Models • 4 items • Updated 5 days ago • 36
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated Mar 18 • 86
DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 27