HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis Paper • 2010.05646 • Published Oct 12, 2020
Parallel Tacotron: Non-Autoregressive and Controllable TTS Paper • 2010.11439 • Published Oct 22, 2020
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis Paper • 2005.05957 • Published May 12, 2020
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation Paper • 2106.07889 • Published Jun 15, 2021
WaveGlow: A Flow-based Generative Network for Speech Synthesis Paper • 1811.00002 • Published Oct 31, 2018
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Paper • 2110.07205 • Published Oct 14, 2021 • 5
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models Paper • 2403.03100 • Published Mar 5 • 34
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration Paper • 2210.01029 • Published Oct 3, 2022 • 1