Running on CPU Upgrade 700 700 Open ASR Leaderboard 🏆 Request and view assessments for speech recognition models
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated 9 days ago • 16