Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published 17 days ago • 73
Gemma 3 Collection A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 32 items • Updated 23 days ago • 27
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 425
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper • 2405.12981 • Published May 21, 2024 • 34
Core ML Text Generation Collection [WIP] On-device LLMs https://huggingface.co/blog/swift-coreml-llm • 3 items • Updated Sep 7, 2023 • 4
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • Nov 19, 2024 • 110
view article Article WWDC 24: Running Mistral 7B with Core ML By FL33TW00D-HF and 3 others • Jul 22, 2024 • 61