Lightweight Models - a milwright Collection

milwright 's Collections

Lightweight Models

Historical Methods

Academic Methods

Lightweight Models

updated 12 days ago

FITS: Modeling Time Series with 10k Parameters

Paper • 2307.03756 • Published Jul 6, 2023
Self-Taught Self-Correction for Small Language Models

Paper • 2503.08681 • Published Mar 11 • 15
DTee8/galactus

5B • Updated Mar 17 • 12 • 1
microsoft/Phi-4-mini-instruct

Text Generation • 4B • Updated May 1 • 206k • 527
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24 • 1.25M • • 1.25k
sesame/csm-1b

Text-to-Speech • 2B • Updated May 27 • 37.4k • • 2.11k
deepcogito/cogito-v1-preview-llama-3B

Text Generation • 4B • Updated Apr 8 • 2.66k • 95
microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 10.2k • 1.1k
open-r1/Mixture-of-Thoughts

Viewer • Updated May 26 • 699k • 41.4k • 244
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated about 1 month ago • 552k • • 810
mistralai/Devstral-Small-2505

Text2Text Generation • 24B • Updated May 26 • 126k • 798
google/gemma-3-27b-it-qat-q4_0-gguf

Image-Text-to-Text • 27B • Updated Apr 11 • 6.77k • 305
google/gemma-3-1b-it

Text Generation • 1.0B • Updated Apr 4 • 2.22M • 491
google/gemma-3-12b-it

Image-Text-to-Text • 12B • Updated Mar 21 • 384k • • 425
Running on Zero

12

12

Cosmos-Predict2 14B

🦀

Text-to-Image world model with Cosmos2