Convert text to speech with emotional voices
Create videos with FFMPEG + Qwen2.5-Coder
Ask questions about images
Generate MIDI music from prompts
Create images of a given character in different poses
Generate realistic voice synthesis using text and reference audio