GRMR V3 Models Collection An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated 1 day ago • 5
One RL to See Them All: Visual Triple Unified Reinforcement Learning Paper • 2505.18129 • Published 13 days ago • 59
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 22 days ago • 110
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 4 items • Updated 7 days ago • 150
view article Article 17 Reasons Why Gradio Isn't Just Another UI Library By ysharma and 1 other • Apr 16 • 37
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated Apr 18 • 27
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 268
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 7 days ago • 195
view article Article Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning By burtenshaw • Apr 1 • 23
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 425
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Apr 15 • 68
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others • Mar 4 • 74
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others • Feb 19 • 70
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated Feb 7 • 22