11 3

Shinji Watanabe

sw005320

sw005320

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

espnet/owsm_v4_medium_1B:Add library_name and ensure correct pipeline tag

new activity 5 months ago

espnet/yodas:YODAS-clean dataset release?

new activity 6 months ago

espnet/Hoon_Chung_zeroth_korean_asr_train_asr_transformer5_raw_bpe_valid.acc.ave:Update README.md

View all activity

Organizations

New activity in espnet/owsm_v4_medium_1B about 1 month ago

Add library_name and ensure correct pipeline tag

#2 opened about 1 month ago by

nielsr

New activity in espnet/yodas 5 months ago

YODAS-clean dataset release?

#13 opened 5 months ago by

seastar105

New activity in espnet/Hoon_Chung_zeroth_korean_asr_train_asr_transformer5_raw_bpe_valid.acc.ave 6 months ago

Update README.md

#1 opened 6 months ago by

castleOne

authored a paper about 1 year ago

Towards Robust Speech Representation Learning for Thousands of Languages

Paper • 2407.00837 • Published Jun 30, 2024 • 11

New activity in espnet/owsm_v3.1_ebf over 1 year ago

TypeError when attempting to use the model

#1 opened over 1 year ago by

cifkao

New activity in espnet/yodas over 1 year ago

Speaker id

#6 opened over 1 year ago by

gargakshat99

liked a dataset over 1 year ago

espnet/yodas

Updated Jun 10, 2024 • 42.8k • 115

authored a paper over 1 year ago

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

updated a Space over 1 year ago

ESPnet2 TTS

📈

Generate speech from text in multiple languages

New activity in ifire/ESPnet2-TTS over 1 year ago

Update requirements.txt

#1 opened over 1 year ago by

sw005320

liked a Space over 1 year ago

OWSM Demo

🔊

authored a paper over 1 year ago

Music ControlNet: Multiple Time-varying Controls for Music Generation

Paper • 2311.07069 • Published Nov 13, 2023 • 45

New activity in espnet/shihlun_asr_whisper_medium_finetuned_librispeech100 over 1 year ago

Needed guidance on fine-tuning whisper on custom dataset.

#1 opened over 1 year ago by

Naram

New activity in espnet/kan-bayashi_ljspeech_vits almost 2 years ago

how to run

#10 opened almost 2 years ago by

0xrk

updated a model over 2 years ago

espnet/kan-bayashi_csmsc_tts_train_tacotron2_raw_phn_pypinyin_g2p_phone_train.loss.best

Text-to-Speech • Updated Dec 5, 2022 • 12 • 3

New activity in espnet/kan-bayashi_csmsc_tts_train_tacotron2_raw_phn_pypinyin_g2p_phone_train.loss.best over 2 years ago

Update README.md

#1 opened over 2 years ago by

tiansz

updated 2 models over 3 years ago

sw005320/Shinji_Watanabe_laborotv_asr_train_blstm

Automatic Speech Recognition • Updated Feb 23, 2022 • 6

sw005320/aidatatang_200zh_conformer

Automatic Speech Recognition • Updated Dec 28, 2021 • 7 • 3

liked a Space over 3 years ago

ESPnet2 SLU

📈

Shinji Watanabe

AI & ML interests

Recent Activity

Organizations

sw005320's activity

Add library_name and ensure correct pipeline tag

YODAS-clean dataset release?

Update README.md

TypeError when attempting to use the model

Speaker id

ESPnet2 TTS

Update requirements.txt

OWSM Demo

Needed guidance on fine-tuning whisper on custom dataset.

how to run

Update README.md

ESPnet2 SLU