YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published Mar 11 • 66
Scaling Transformers for Low-Bitrate High-Quality Speech Coding Paper • 2411.19842 • Published Nov 29, 2024 • 12
Efficient Audio Captioning with Encoder-Level Knowledge Distillation Paper • 2407.14329 • Published Jul 19, 2024 • 5
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Paper • 2405.00233 • Published Apr 30, 2024 • 18
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published Apr 18, 2024 • 40
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies Paper • 2308.01546 • Published Aug 3, 2023 • 18
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 44
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 44
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 44