deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 12 days ago β’ 973k β’ β’ 1.14k
Running 1.14k 1.14k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
mistralai/Mistral-Small-24B-Instruct-2501 Text Generation β’ Updated 19 days ago β’ 729k β’ β’ 801