Commit
·
ecbc11f
1
Parent(s):
e10defc
Update README.md
Browse files
README.md
CHANGED
|
@@ -75,4 +75,15 @@ The following hyperparameters were used during training:
|
|
| 75 |
Note : Training and evaluation script modified from https://huggingface.co/chmanoj/xls-r-300m-te and https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event.
|
| 76 |
Bengali speech data was not available from common voice or librispeech multilingual datasets, so OpenSLR53 has been used.
|
| 77 |
|
| 78 |
-
Note 2 : Minimum audio duration of 0.1s has been used to filter the training data which excluded may be 10-20 samples.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 75 |
Note : Training and evaluation script modified from https://huggingface.co/chmanoj/xls-r-300m-te and https://github.com/huggingface/transformers/tree/master/examples/research_projects/robust-speech-event.
|
| 76 |
Bengali speech data was not available from common voice or librispeech multilingual datasets, so OpenSLR53 has been used.
|
| 77 |
|
| 78 |
+
Note 2 : Minimum audio duration of 0.1s has been used to filter the training data which excluded may be 10-20 samples.
|
| 79 |
+
|
| 80 |
+
# Citation
|
| 81 |
+
|
| 82 |
+
@misc {tahsin_mayeesha_2023,
|
| 83 |
+
author = { {Tahsin Mayeesha} },
|
| 84 |
+
title = { wav2vec2-bn-300m (Revision e10defc) },
|
| 85 |
+
year = 2023,
|
| 86 |
+
url = { https://huggingface.co/Tahsin-Mayeesha/wav2vec2-bn-300m },
|
| 87 |
+
doi = { 10.57967/hf/0939 },
|
| 88 |
+
publisher = { Hugging Face }
|
| 89 |
+
}
|