YAQA Collection YAQA hessians (Sketch B) and models with the QTIP quantizer. See https://github.com/Cornell-RelaxML/yaqa/tree/main for more details. β’ 9 items β’ Updated 3 days ago β’ 1
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated Apr 17 β’ 41
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices β’ 16 items β’ Updated about 12 hours ago β’ 45
GRMR V3 Models Collection An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). β’ 6 items β’ Updated 5 days ago β’ 9
LaViDa-1.0 Collection LArge VIsion-language Diffusion moDel with mAsking β’ 11 items β’ Updated 14 days ago β’ 7
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). β’ 37 items β’ Updated 20 days ago β’ 39
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. β’ 4 items β’ Updated 11 days ago β’ 154
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated May 5 β’ 227
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 23 items β’ Updated May 2 β’ 193
GrayLine Collection - Qwen3 Collection Unalignment + Reasoning β’ 8 items β’ Updated 23 days ago β’ 8
Falcon Edge series Collection A series of powerful, universal and fine-tunable small Language Models β’ 7 items β’ Updated 20 days ago β’ 22
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ 26 days ago β’ 112