Running 2.73k 2.73k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
microsoft/Phi-4-multimodal-instruct-onnx Automatic Speech Recognition • Updated 17 days ago • 135 • 73
Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404 Text Generation • 3B • Updated Apr 17 • 1.79k • 62