openai/whisper-large-v3-turbo Automatic Speech Recognition • Updated Oct 4, 2024 • 4.72M • • 1.9k
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 28 days ago • 43
Running on L4 156 156 CosyVoice2-0.5B 🥳 Generate realistic voice audio from text and audio prompts
Running on Zero 196 196 Seed Voice Conversion 🎤 Convert voice to match another using reference audio