Towards Robust Speech Representation Learning for Thousands of Languages Paper β’ 2407.00837 β’ Published Jun 30, 2024 β’ 11
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper β’ 2401.16658 β’ Published Jan 30, 2024 β’ 14
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 45
espnet/kan-bayashi_csmsc_tts_train_tacotron2_raw_phn_pypinyin_g2p_phone_train.loss.best Text-to-Speech β’ Updated Dec 5, 2022 β’ 4 β’ 3
sw005320/Shinji_Watanabe_laborotv_asr_train_blstm Automatic Speech Recognition β’ Updated Feb 23, 2022 β’ 2
sw005320/aidatatang_200zh_conformer Automatic Speech Recognition β’ Updated Dec 28, 2021 β’ 2 β’ 3