YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation Paper β’ 2407.04822 β’ Published Jul 5, 2024 β’ 4
Running 256 256 Qwen2.5 Omni 7B Demo π Generate text and speech responses from text, images, or audio input