Running 89 Unlocking On-Policy Distillation for Any Model Family 📝 89 Visualize on-policy distillation for any model family
Running 79 Maintain the unmaintainable 📚 79 Explore the complex relationships between 400+ machine learning models
Running 3.73k The Ultra-Scale Playbook 🌌 3.73k The ultimate guide to training LLM on large GPU Clusters