Kunal Dhawan commited on
Commit
e2ff9b4
·
1 Parent(s): f3ccd7e

updated model description

Browse files

Signed-off-by: Kunal Dhawan <[email protected]>

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -266,7 +266,8 @@ img {
266
  </style>
267
 
268
  ## Description:
269
- NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieves state-of-the art performance on multiple speech benchmarks. With 883 million parameters and running at more than 900 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). In addition to this, canary-1b-flash also supports functionality for word-level and segment-level timestamps for English, German, French, and Spanish. This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
 
270
 
271
 
272
  ## Model Architecture:
@@ -576,7 +577,6 @@ F1-score on [Librispeech Test sets](https://www.openslr.org/12) at collar value
576
  |:-----------:|:---------:|:----------:|:----------:|
577
  | nemo-main | canary-1b-flash | 95.5 | 93.5 |
578
 
579
- Note that this is an experimental feature currently and not recommended for production use cases.
580
 
581
  ### Hallucination Robustness
582
  Number of characters per minute on [MUSAN](https://www.openslr.org/17) 48 hrs eval set
 
266
  </style>
267
 
268
  ## Description:
269
+ NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and running at more than 900 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, Canary-1B-Flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
270
+ This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
271
 
272
 
273
  ## Model Architecture:
 
577
  |:-----------:|:---------:|:----------:|:----------:|
578
  | nemo-main | canary-1b-flash | 95.5 | 93.5 |
579
 
 
580
 
581
  ### Hallucination Robustness
582
  Number of characters per minute on [MUSAN](https://www.openslr.org/17) 48 hrs eval set