Create personalized speech using text and audio samples
Generate music from text descriptions
Transcribe audio and YouTube videos to text