nithinraok commited on
Commit
a0b7f5b
Β·
1 Parent(s): 652da3a

Signed-off-by: nithinraok <[email protected]>

Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -265,6 +265,9 @@ img {
265
  }
266
  </style>
267
 
 
 
 
268
  ## Description:
269
  NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and an inference speed of more than 1000 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, canary-1b-flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
270
  This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
 
265
  }
266
  </style>
267
 
268
+ > **πŸŽ‰ NEW: Canary 1B V2 is now available!**
269
+ > 🌍 **25 European Languages** | ⏱️ **Much Improved Timestamp Prediction** | πŸ”„ **Enhanced ASR & AST** | πŸ”— **[Try it here: nvidia/canary-1b-v2](https://huggingface.co/nvidia/canary-1b-v2)**
270
+
271
  ## Description:
272
  NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and an inference speed of more than 1000 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, canary-1b-flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
273
  This model is released under the permissive CC-BY-4.0 license and is available for commercial use.