Gemma 3 Collection A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 31 items • Updated 14 days ago • 19
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 14 days ago • 345
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper • 2405.12981 • Published May 21, 2024 • 32
Core ML Text Generation Collection [WIP] On-device LLMs https://huggingface.co/blog/swift-coreml-llm • 3 items • Updated Sep 7, 2023 • 3