Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kotoba-speech
's Collections
[Spaces] Internal Demo
[Model] Internal Models for Preprocessing
[Model] Alignment-aware TTS models
[Result] ASR
[Dataset] Chunk-TTS training datasets
[Dataset] TTS training datasets
[Dataset] ASR training datasets
[Dataset] SimulT Mimi Tokenized (2 min.)
[Dataset] Mimi Tokenize + Denoise (2 min.)
[Dataset] Mimi Tokenize (2 min.)
[Dataset] Transcribe (2 min.)
[Dataset] Transcribe (full)
[Dataset] Crawl Raw Audio
[Dataset] Eval ASR
[legacy] TTS Audio (denoise + aug.)
[legacy] Simul-Translate Mimi Tokens (30 sec.)
[legacy] Simul-Translate Audio (30 sec.)
[legacy] Transcribe (30 sec.)
[legacy] Transcribe (5 min.)
[Result] ASR
updated
5 days ago
Upvote
-
This collection has no items.
Upvote
-
Share collection
View history
Collection guide
Browse collections