view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others ā¢ Oct 29, 2024 ā¢ 51
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others ā¢ Oct 8, 2024 ā¢ 46
view article Article Google releases Gemma 2 2B, ShieldGemma and Gemma Scope By Xenova and 3 others ā¢ Jul 31, 2024 ā¢ 58
view article Article Code Llama: Llama 2 learns to code By philschmid and 7 others ā¢ Aug 25, 2023 ā¢ 9
view article Article Assisted Generation: a new direction toward low-latency text generation By joaogante ā¢ May 11, 2023 ā¢ 51