Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kotoba-speech
's Collections
[Spaces] Internal Demo
[Model] Internal Models for Preprocessing
[Model] Alignment-aware TTS models
[Result] ASR
[Dataset] Chunk-TTS training datasets
[Dataset] TTS training datasets
[Dataset] ASR training datasets
[Dataset] SimulT Mimi Tokenized (2 min.)
[Dataset] Mimi Tokenize + Denoise (2 min.)
[Dataset] Mimi Tokenize (2 min.)
[Dataset] Transcribe (2 min.)
[Dataset] Transcribe (full)
[Dataset] Crawl Raw Audio
[Dataset] Eval ASR
[legacy] TTS Audio (denoise + aug.)
[legacy] Simul-Translate Mimi Tokens (30 sec.)
[legacy] Simul-Translate Audio (30 sec.)
[legacy] Transcribe (30 sec.)
[legacy] Transcribe (5 min.)
[Dataset] Eval ASR
updated
11 days ago
Upvote
-
japanese-asr/ja_asr.jsut_basic5000
Viewer
•
Updated
Apr 14, 2024
•
5k
•
155
•
5
japanese-asr/ja_asr.common_voice_8_0
Viewer
•
Updated
Apr 14, 2024
•
4.48k
•
45
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections