Running on Zero 118 118 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System ๐ Generate audio from text using a reference audio sample
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข 6B โข Updated May 1 โข 418k โข 1.44k