Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 4 items • Updated 1 day ago • 19
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Paper • 2310.12378 • Published Oct 18, 2023
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Paper • 2306.08753 • Published Jun 14, 2023 • 1
Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach Paper • 2309.05248 • Published Sep 11, 2023