Aymeric Roucher's picture

Aymeric Roucher

m-ric

·

http://aymeric-roucher.github.io

AI & ML interests

Leading Agents at Hugging Face 🤗

Organizations

Posts 104

Post

2505

Diffusion LLMs are coming for autoregressive LLMs ⚡️⚡️ Inception Labs' new diffusion model demolishes all leading LLMs on generation speed, with equal quality !

Inception Labs was founded a few months ago, and they're not sleeping: after dropping a code model, they just published Mercury chat, a diffusion-based chat model that reaches 1000 tokens / second on H100, i.e. 10x more than models of equivalent performance on the same hardware!

What's the breakthrough? Well instead, of generating tokens left-to-right like the more common autoregressive LLMs, diffusion models generate their blocks of text all at once, and successive steps refine the whole text.

Diffusion models being really fast at isn't new, we have had some promising results on this by Google already back in May with Gemini Diffusion, and Mercury themselves had already published their coding model a few months ago

But being that good quality is new - and now Inception Labs just proved that their models work well in chat too, which could have been challenging given that's streaming generation is well suited to left-to-right generation.

They have a playground available at chat dot inceptionlabs dot ai, I recommend giving it a try!

Articles 14

Article

27

ScreenEnv: Deploy your full stack Desktop Agent

View all Articles

Collections 15

View 15 collections

spaces 30

AI Travel Planner

Plan your next vacation with the help of an AI!

Running on Zero

Open NotebookLM

Generate a podcast to discuss the topic of your choice!

Agent Data Analyst

Need to analyze data? Let a Llama-3.1 agent do it for you!

Chunk Visualizer

Pick a text splitter => visualize chunks. Great for RAG.

Running on Zero

Beam Search Visualizer

View how beam search decoding works, in detail!

Google Magenta Music Generation

Generate music with Google's Magenta

models 4

m-ric/OpenR1-SmolLM2-1.7B-Instruct-Agentic

Text Generation • 2B • Updated Apr 9 • 25 • 2

m-ric/Aria_hf_2

Image-Text-to-Text • 25B • Updated Nov 29, 2024 • 7

m-ric/Aria_hf_3

Updated Nov 29, 2024

m-ric/Aria_hf

Image-Text-to-Text • 25B • Updated Oct 26, 2024 • 8

datasets 20

m-ric/financial-news-2024

Viewer • Updated 9 days ago • 14.8k • 169

m-ric/smolagents_benchmark_200

Viewer • Updated May 6 • 200 • 19

m-ric/images

Viewer • Updated Mar 27 • 5 • 11.8k

m-ric/smol_agents_benchmark

Viewer • Updated Jan 14 • 132 • 418 • 2

m-ric/agents_medium_benchmark_3

Viewer • Updated Jan 8 • 132 • 68

m-ric/agents_medium_benchmark_2

Viewer • Updated Dec 27, 2024 • 142 • 99 • 11

m-ric/agents_medium_benchmark

Viewer • Updated Dec 23, 2024 • 172 • 42 • 3

m-ric/newsletter

Updated Sep 10, 2024 • 21

m-ric/songs_july_24

Viewer • Updated Jul 31, 2024 • 85.6k • 38

m-ric/huggingface_doc_qa_eval

Viewer • Updated Jul 3, 2024 • 65 • 176 • 9

View 20 datasets