microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated May 1 • 480k • 1.41k
Running 553 553 Talking Face Generation with Multilingual TTS 👄 Generate a talking face video from text