Smule Renaissance Small

A 10.4M paramater generative audio model for restoring degraded vocals in any situation that runs 10.5x faster than real-time on iPhone 12's CPU. Outperforms all open source models in subjective quality; matches commericial models on singing voice restoration.

Technical Report: Technical Report

Extreme Degradation Bench: Hugging Face Model


Getting Started

Setting up environment

# Create a virtual environment
uv venv cleanup --python=3.10
source cleanup/bin/activate
uv pip install -r requirements.txt

Running the model

python main.py {path-to-input} -o {path-to-output} -c {path-to-checkpoint}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support