Generate speech from Cantonese text using a chosen voice
Separate vocals from background in audio
FoundHand