Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Paper • 2502.04128 • Published Feb 6 • 26
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Paper • 2305.06908 • Published May 11, 2023 • 6
CoMoSVC: Consistency Model-based Singing Voice Conversion Paper • 2401.01792 • Published Jan 3, 2024 • 11
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model Paper • 2408.17175 • Published Aug 30, 2024 • 4