Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

upvoted a changelog about 14 hours ago

New Inference Providers Dashboard

upvoted a collection 1 day ago

upvoted a paper 4 days ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

View all activity

Organizations

osanseviero's activity

upvoted a changelog about 14 hours ago

Changelog

New Inference Providers Dashboard

about 18 hours ago

• 23

upvoted a collection 1 day ago

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated 1 day ago • 5

upvoted a paper 4 days ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published 13 days ago • 59

upvoted an article 5 days ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

22 days ago

• 110

upvoted 2 collections 16 days ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 4 items • Updated 7 days ago • 150

Gemma 3n Preview

2 items • Updated 7 days ago • 109

upvoted an article about 1 month ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

By

and 1 other •

Apr 16

• 37

upvoted a collection about 2 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated Apr 18 • 27

upvoted a paper about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 268

upvoted an article about 2 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 184

upvoted a collection 2 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 7 days ago • 195

upvoted an article 2 months ago

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

By

•

Apr 1

• 23

upvoted a paper 2 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 51

upvoted a collection 3 months ago

Gemma 3 Release

24 items • Updated 7 days ago • 379

upvoted an article 3 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 425

upvoted a collection 3 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Apr 15 • 68

upvoted an article 3 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

By

and 3 others •

Mar 4

• 74

upvoted a paper 3 months ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

upvoted an article 3 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

By

and 2 others •

Feb 19

• 70

upvoted a collection 3 months ago

GemmaX2

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated Feb 7 • 22