unsloth/Nemotron-3-Nano-30B-A3B-GGUF Text Generation • 32B • Updated less than a minute ago • 74.7k • 163
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 24 days ago • 62
Running on CPU Upgrade Featured 2.71k The Smol Training Playbook 📚 2.71k The secrets to building world-class LLMs