Whisper Distillation

community

https://github.com/huggingface/distil-whisper

Activity Feed Request to join this org

AI & ML interests

Robust knowledge distillation of the Whisper model via large-scale pseudo-labelling.

Recent Activity

Xenova authored a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

reach-vb authored a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

reach-vb new activity 21 days ago

distil-whisper/distil-large-v3.5:Move Transformers.js-compatible version to separate repo

View all activity

distil-whisper's activity

Xenova

authored a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 8 days ago • 158

reach-vb

authored a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 8 days ago • 158

reach-vb

in distil-whisper/distil-large-v3.5 21 days ago

Move Transformers.js-compatible version to separate repo

#5 opened 21 days ago by

Xenova

updated a model 21 days ago

distil-whisper/distil-large-v3.5-ONNX

Automatic Speech Recognition • Updated 21 days ago • 21 • 1

Xenova

in distil-whisper/distil-large-v3.5 21 days ago

Move Transformers.js-compatible version to separate repo

#5 opened 21 days ago by

Xenova

updated a collection 21 days ago

distil-large-v3.5

Collection

This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. • 5 items • Updated 21 days ago • 7

Xenova

published a model 21 days ago

distil-whisper/distil-large-v3.5-ONNX

Automatic Speech Recognition • Updated 21 days ago • 21 • 1

Steveeeeeeen

in distil-whisper/distil-large-v3.5 22 days ago

Update README.md

#4 opened 22 days ago by

bofenghuang

Steveeeeeeen

published 3 models 22 days ago

Steveeeeeeen

updated a collection 22 days ago

distil-large-v3.5

Collection

This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. • 5 items • Updated 21 days ago • 7

Xenova

in distil-whisper/distil-large-v3 about 1 month ago

Transformers.js - Enable external data format in Node.js

#13 opened about 1 month ago by

Xenova

Adding ONNX file of this model

#11 opened 6 months ago by

pavanteja007

Xenova

in distil-whisper/distil-large-v2 about 1 month ago

Update to Transformers.js v3

#27 opened about 1 month ago by

Xenova

posted an update 2 months ago

Post

12920

We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️

Generate 10 seconds of speech in ~1 second for $0.

What will you build? 🔥
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
✂️ Implement sentence splitting, allowing for streamed responses
🌍 Multilingual support (only phonemization left)

Who wants to help?