view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 135
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 164
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others • Feb 19 • 70
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 214
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub By jsulz and 3 others • Feb 12 • 66
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.26k
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints By sergeipetrov and 3 others • May 1, 2024 • 77
view article Article Visualize and understand GPU memory in PyTorch By qgallouedec • Dec 24, 2024 • 225
view article Article Deploying Speech-to-Speech on Hugging Face By andito and 3 others • Oct 22, 2024 • 39
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 867
view article Article How to deploy and fine-tune DeepSeek models on AWS By pagezyhf and 2 others • Jan 30 • 52
view article Article Welcome to Inference Providers on the Hub 🔥 By julien-c and 6 others • Jan 28 • 483
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 187
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 139
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 673
FlashSpeech: Efficient Zero-Shot Speech Synthesis Paper • 2404.14700 • Published Apr 23, 2024 • 33