view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • about 24 hours ago • 9
view article Article Open R1: How to use OlympicCoder locally for coding? By burtenshaw and 4 others • 6 days ago • 45
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 822
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others • Oct 29, 2024 • 52
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others • Oct 8, 2024 • 46
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others • Sep 25, 2024 • 185
view article Article How NuminaMath Won the 1st AIMO Progress Prize By yfleureau and 7 others • Jul 11, 2024 • 118
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others • Jun 27, 2024 • 129
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others • Jan 18, 2024 • 50
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face By lewtun and 6 others • Dec 11, 2023 • 12
view article Article SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit By ronenlap and 5 others • Dec 6, 2023 • 8
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 22
view article Article Code Llama: Llama 2 learns to code By philschmid and 7 others • Aug 25, 2023 • 9
view article Article Llama 2 is here - get it on Hugging Face By osanseviero and 3 others • Jul 18, 2023 • 25