Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nvidia
/
canary-1b-v2

Automatic Speech Recognition
NeMo
PyTorch
automatic-speech-translation
speech
audio
Transformer
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Eval Results
Model card Files Files and versions
xet
Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

there is diff in size of output feautres between EncDecMultiTaskModel and pretrained canary-1b-v2.nemo loaded

1
#14 opened 12 days ago by
mrunique007

You should open source this.

#13 opened 16 days ago by
CyborgPaloma

Error above ~1 minute ASR in hungarian sample.

1
#12 opened about 1 month ago by
robert1968

NIM API

3
#10 opened about 1 month ago by
Buttermilk03

Clarification on the tokenizer: Concatenated tokenizer or Aggregate tokenizer?

4
#9 opened about 2 months ago by
leestevennz

Is it possible to use "prompt" or "hotwords" to steer decoding similar to Whisper?

2
#8 opened about 2 months ago by
spashii

Webui available

🔥 🤗 3
#6 opened about 2 months ago by
methinkss

Local Installation Video and Testing - Step by Step

❤️ 1
1
#5 opened about 2 months ago by
fahdmirzac

Finetuning script for other languages

#4 opened about 2 months ago by
psk

How is Arabic language not added??

1
#3 opened about 2 months ago by
omar26

Turkish Language

#2 opened about 2 months ago by
Eurdem
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs