Running 2.81k 2.81k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text β’ 11B β’ Updated Dec 4, 2024 β’ 362k β’ β’ 1.48k
deepseek-ai/DeepSeek-Coder-V2-Lite-Base Text Generation β’ 16B β’ Updated Jul 3, 2024 β’ 5.63k β’ 86