Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 23 days ago • 77
WavTokenizer-Medium-Large Collection https://arxiv.org/abs/2408.16532 • 4 items • Updated Feb 25 • 11
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 51
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6 • 70
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published Dec 30, 2024 • 19