A Unified Framework for Image Customization
High-fidelity Virtual Try-on
Generate music from text descriptions
Generate images from text descriptions
Transcribe audio from microphone, files, or YouTube