Running 1.14k 1.14k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita π₯ 4 days ago β’ 86
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other β’ 29 days ago β’ 63
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others β’ Jan 20 β’ 34
Jan 17 Releases βοΈ Collection Models and datasets of the second week of Jan 2025. β’ 23 items β’ Updated Jan 17 β’ 11
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 β’ 68