Junyi Ao's picture

Junyi Ao

ajyy

·

https://ajyy.github.io/

ajyy

AI & ML interests

None yet

Organizations

None yet

upvoted a paper almost 2 years ago

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Paper • 2110.07205 • Published Oct 14, 2021 • 5

upvoted a collection almost 2 years ago

SpeechT5

The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated May 1, 2025 • 26