Generate audio from text using a reference voice
Generate Vietnamese speech from text and audio sample