AlexanderMaz
/

LanguageModel_Fusion

Automatic Speech Recognition

Model card Files Files and versions

AlexanderMaz commited on Jan 6, 2024

Commit

b717995

·

1 Parent(s): 36c6e03

Update README

Add data description.

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- librispeech_asr
+language:
+- en
+metrics:
+- wer
+pipeline_tag: automatic-speech-recognition
+tags:
+- asr
+- rescoring
+- rnn-t
+- gpt2
+- nemo
+- lstm
+- kenlm
 ---
+The data is used in project https://github.com/Alexander92-cpu/LanguageModel_Fusion
+Data desciption:
+'asr/stt_en_conformer_transducer_small.nemo' - NeMo ASR pre-trained RNN-T model;
+'gpt2' - fine-tuned GPT-2 LM model for rescoring;
+'kenlm/4_ngram_output.bin' - 4-gram language model;
+'lstm' - trained from scratch word-level LSTM LM model and the corresponding tokenizer;
+'text' - contains text data used for training, validation, and testing.