Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published 22 days ago • 74 • 9
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • 23 days ago • 105
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 115
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
Running 2.84k 2.84k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • Feb 11 • 31
view article Article From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning By NormalUhr • Feb 4 • 16
gaochangkuan/whisper-large-v2_FT_model_checkpoints Automatic Speech Recognition • 2B • Updated Sep 29, 2024 • 2
gaochangkuan/whisper-large-v2_FT_model_checkpoints2 Automatic Speech Recognition • 2B • Updated Sep 18, 2024 • 3