AI & ML interests

Robust knowledge distillation of the Whisper model via large-scale pseudo-labelling.

Recent Activity

distil-whisper's activity

Xenovaย 
in distil-whisper/distil-large-v2 about 1 month ago

Update to Transformers.js v3

#27 opened about 1 month ago by
Xenova
Xenovaย 
posted an update 2 months ago
view post
Post
12920
We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. โšก๏ธ

Generate 10 seconds of speech in ~1 second for $0.

What will you build? ๐Ÿ”ฅ
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
โœ‚๏ธ Implement sentence splitting, allowing for streamed responses
๐ŸŒ Multilingual support (only phonemization left)

Who wants to help?
ยท