view article Article The NLP Course is becoming the LLM Course! By burtenshaw and 9 others β’ 11 days ago β’ 67
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others β’ 20 days ago β’ 17
view article Article Open R1: How to use OlympicCoder locally for coding? By burtenshaw and 4 others β’ 25 days ago β’ 57
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 839
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others β’ Oct 29, 2024 β’ 55
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others β’ Oct 8, 2024 β’ 46
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others β’ Sep 25, 2024 β’ 188
view article Article How NuminaMath Won the 1st AIMO Progress Prize By yfleureau and 7 others β’ Jul 11, 2024 β’ 119
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others β’ Jun 27, 2024 β’ 129
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others β’ Jan 18, 2024 β’ 55
view article Article Mixture of Experts Explained By osanseviero and 5 others β’ Dec 11, 2023 β’ 545
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face By lewtun and 6 others β’ Dec 11, 2023 β’ 12
view article Article SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit By ronenlap and 5 others β’ Dec 6, 2023 β’ 9
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others β’ Sep 13, 2023 β’ 22
view article Article Code Llama: Llama 2 learns to code By philschmid and 7 others β’ Aug 25, 2023 β’ 9