4 23 31

Rasmus Aagaard

rasgaard

https://rasgaard.com

AI & ML interests

Industrial PhD student, research on model compression techniques and efficient on-device inference

Recent Activity

upvoted an article 13 days ago

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

updated a Space 23 days ago

rasgaard/trackio-testing

published a Space 23 days ago

rasgaard/trackio-testing

View all activity

Organizations

upvoted an article 13 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

15 days ago

•

upvoted 3 articles about 1 month ago

Article

Curating datasets directly on the Hub

Nov 27, 2025

•

Article

Continuous batching from first principles

Nov 25, 2025

•

289

Article

AI Model Optimization More Flexible Than Ever

Nov 17, 2025

•

upvoted an article about 2 months ago

Article

Running Large Transformer Models on Mobile and Edge Devices

Nov 3, 2025

•

upvoted 2 articles 2 months ago

Article

Streaming datasets: 100x More Efficient

Oct 27, 2025

•

Article

Sentence Transformers is joining Hugging Face!

Oct 22, 2025

•

upvoted 3 articles 3 months ago

Article

Get your VLM running in 3 simple steps on Intel CPUs

Oct 15, 2025

•

Article

SOTA OCR with Core ML and dots.ocr

Oct 2, 2025

•

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

upvoted a paper 4 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 134

upvoted an article 4 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

208

upvoted a paper 4 months ago

TorchAO: PyTorch-Native Training-to-Serving Model Optimization

Paper • 2507.16099 • Published Jul 21, 2025 • 7

upvoted an article 7 months ago

Article

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

Jun 2, 2025

•

upvoted a collection 7 months ago

POTION

Collection

These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 6 items • Updated Nov 13, 2025 • 14

upvoted 2 articles 8 months ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

May 13, 2025

•

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

579

upvoted a collection 9 months ago

Orpheus Multilingual Research Release

Collection

Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 108

upvoted an article 11 months ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Feb 11, 2025

•

upvoted a collection about 1 year ago

Danish Text Datasets

Collection

These include high-quality Danish text datasets for pre-training, fine-tuning, etc. • 16 items • Updated Dec 15, 2024 • 3

Rasmus Aagaard

AI & ML interests

Recent Activity

Organizations

rasgaard's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Curating datasets directly on the Hub

Continuous batching from first principles

AI Model Optimization More Flexible Than Ever

Running Large Transformer Models on Mobile and Edge Devices

Streaming datasets: 100x More Efficient

Sentence Transformers is joining Hugging Face!

Get your VLM running in 3 simple steps on Intel CPUs

SOTA OCR with Core ML and dots.ocr

There is no such thing as a tokenizer-free lunch

KV Caching Explained: Optimizing Transformer Inference Efficiency

AI Policy @🤗: Response to the 2025 National AI R&D Strategic Plan

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages