Sleeping 395 395 HierSpeech++ (Zero-shot TTS) β‘ Generate high-quality speech from text using a prompt audio
pyannote/speaker-diarization-3.1 Automatic Speech Recognition β’ Updated May 10, 2024 β’ 14.4M β’ 908
stabilityai/stable-video-diffusion-img2vid-xt Image-to-Video β’ Updated Jul 10, 2024 β’ 133k β’ 3.07k
pyannote/speaker-diarization Automatic Speech Recognition β’ Updated May 10, 2024 β’ 1.19M β’ 1.08k