MSRChallenge (Music Source Restoration Challenge)

Higobeatz

authored 3 papers 3 months ago

Noise-robust Speech Separation with Fast Generative Correction

Paper • 2406.07461 • Published Jun 11, 2024

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25 • 4

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3 • 8

westbrook

authored 4 papers 3 months ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3 • 8

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25 • 4

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Paper • 2505.14648 • Published May 20 • 8

Noise-robust Speech Separation with Fast Generative Correction

Paper • 2406.07461 • Published Jun 11, 2024

yongyizang

authored a paper 6 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 69

Higobeatz

authored 3 papers 12 months ago

DreamVoice: Text-Guided Voice Conversion

Paper • 2406.16314 • Published Jun 24, 2024 • 1

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

Paper • 2409.07556 • Published Sep 11, 2024 • 2

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Paper • 2409.08425 • Published Sep 12, 2024 • 10

westbrook

authored 5 papers 12 months ago

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer

Paper • 2409.08425 • Published Sep 12, 2024 • 10

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 20

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset

Paper • 2306.03030 • Published Jun 5, 2023 • 1

DreamVoice: Text-Guided Voice Conversion

Paper • 2406.16314 • Published Jun 24, 2024 • 1

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

Paper • 2409.07556 • Published Sep 11, 2024 • 2

Higobeatz

authored a paper 12 months ago

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 20

qiuqiangkong

authored 2 papers about 2 years ago

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 44

AI & ML interests

Team members 6

MSRChallenge's activity