timestamp description
Browse files
README.md
CHANGED
@@ -266,7 +266,9 @@ img {
|
|
266 |
</style>
|
267 |
|
268 |
## Description:
|
269 |
-
NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieves state-of-the art performance on multiple speech benchmarks. With 182 million parameters and running at more then 1300 RTFx (on open-asr-leaderboard sets), canary-180m-flash supports automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC).
|
|
|
|
|
270 |
|
271 |
|
272 |
## Model Architecture:
|
|
|
266 |
</style>
|
267 |
|
268 |
## Description:
|
269 |
+
NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieves state-of-the art performance on multiple speech benchmarks. With 182 million parameters and running at more then 1300 RTFx (on open-asr-leaderboard sets), canary-180m-flash supports automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC).
|
270 |
+
Additionally, canary-180m-flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
|
271 |
+
This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
|
272 |
|
273 |
|
274 |
## Model Architecture:
|