view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 3 days ago • 113
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated about 1 month ago • 74
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 7 hours ago • 411k • 1.11k
HuggingFaceFW/fineweb-edu-classifier Text Classification • Updated Nov 17, 2024 • 39.1k • • 171
Running 2.23k 2.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • Updated 17 days ago • 384k • • 628
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • Updated 17 days ago • 672k • • 465