Commit
·
f2d877b
1
Parent(s):
7620057
Switched position of tables
Browse files
README.md
CHANGED
@@ -183,6 +183,27 @@ The model was evaluated using the following metrics:
|
|
183 |
|
184 |
<img src="https://huggingface.co/CoRal-dataset/roest-wav2vec2-315m-v2/resolve/main/images/cer.png">
|
185 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
186 |
### Table CER scores in % of evaluation across demographics on the CoRal test data
|
187 |
| Category | roest-whisper-large-v1 | roest-wav2vec2-315m-v1 | roest-wav2vec2-315m-v2 |
|
188 |
|:---:|:---:|:---:|:---:|
|
@@ -203,25 +224,6 @@ The model was evaluated using the following metrics:
|
|
203 |
| Østjysk | 2.6 | 4.0 | 4.1 |
|
204 |
| Overall | 4.3 | 6.6 | 6.5 |
|
205 |
|
206 |
-
### Table WER scores in % of evaluation across demographics on the CoRal test data
|
207 |
-
| Category | roest-whisper-large-v1 | roest-wav2vec2-315m-v1 | roest-wav2vec2-315m-v2 |
|
208 |
-
|:---:|:---:|:---:|:---:|
|
209 |
-
| female | 11.5 | 18.5 | 17.7 |
|
210 |
-
| male | 9.4 | 15.5 | 14.9 |
|
211 |
-
| 0-25 | 9.0 | 14.7 | 14.0 |
|
212 |
-
| 25-50 | 10.1 | 16.6 | 15.8 |
|
213 |
-
| 50+ | 11.3 | 18.2 | 17.7 |
|
214 |
-
| Bornholmsk | 9.8 | 17.7 | 15.7 |
|
215 |
-
| Fynsk | 12.1 | 18.3 | 17.7 |
|
216 |
-
| Københavnsk | 5.9 | 10.2 | 10.0 |
|
217 |
-
| Non-native | 12.2 | 20.9 | 19.4 |
|
218 |
-
| Nordjysk | 4.5 | 7.7 | 7.5 |
|
219 |
-
| Sjællandsk | 7.6 | 12.6 | 12.7 |
|
220 |
-
| Sydømål | 10.0 | 14.9 | 15.3 |
|
221 |
-
| Sønderjysk | 17.5 | 26.0 | 25.4 |
|
222 |
-
| Vestjysk | 15.0 | 26.3 | 25.2 |
|
223 |
-
| Østjysk | 7.5 | 11.7 | 11.3 |
|
224 |
-
| Overall | 10.4 | 17.0 | 16.3 |
|
225 |
|
226 |
### Roest-wav2vec2-315M with and without language model
|
227 |
The inclusion of a post-processing language model can affect the performance significantly. The Roest-v1 and Roest-v2 models are using the same Language Model (LM). The utilized LM is the one trained and used by [alexandrainst/roest-wav2vec2-315m-v1](https://huggingface.co/alexandrainst/roest-315m).
|
|
|
183 |
|
184 |
<img src="https://huggingface.co/CoRal-dataset/roest-wav2vec2-315m-v2/resolve/main/images/cer.png">
|
185 |
|
186 |
+
### Table WER scores in % of evaluation across demographics on the CoRal test data
|
187 |
+
| Category | roest-whisper-large-v1 | roest-wav2vec2-315m-v1 | roest-wav2vec2-315m-v2 |
|
188 |
+
|:---:|:---:|:---:|:---:|
|
189 |
+
| female | 11.5 | 18.5 | 17.7 |
|
190 |
+
| male | 9.4 | 15.5 | 14.9 |
|
191 |
+
| 0-25 | 9.0 | 14.7 | 14.0 |
|
192 |
+
| 25-50 | 10.1 | 16.6 | 15.8 |
|
193 |
+
| 50+ | 11.3 | 18.2 | 17.7 |
|
194 |
+
| Bornholmsk | 9.8 | 17.7 | 15.7 |
|
195 |
+
| Fynsk | 12.1 | 18.3 | 17.7 |
|
196 |
+
| Københavnsk | 5.9 | 10.2 | 10.0 |
|
197 |
+
| Non-native | 12.2 | 20.9 | 19.4 |
|
198 |
+
| Nordjysk | 4.5 | 7.7 | 7.5 |
|
199 |
+
| Sjællandsk | 7.6 | 12.6 | 12.7 |
|
200 |
+
| Sydømål | 10.0 | 14.9 | 15.3 |
|
201 |
+
| Sønderjysk | 17.5 | 26.0 | 25.4 |
|
202 |
+
| Vestjysk | 15.0 | 26.3 | 25.2 |
|
203 |
+
| Østjysk | 7.5 | 11.7 | 11.3 |
|
204 |
+
| Overall | 10.4 | 17.0 | 16.3 |
|
205 |
+
|
206 |
+
|
207 |
### Table CER scores in % of evaluation across demographics on the CoRal test data
|
208 |
| Category | roest-whisper-large-v1 | roest-wav2vec2-315m-v1 | roest-wav2vec2-315m-v2 |
|
209 |
|:---:|:---:|:---:|:---:|
|
|
|
224 |
| Østjysk | 2.6 | 4.0 | 4.1 |
|
225 |
| Overall | 4.3 | 6.6 | 6.5 |
|
226 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
227 |
|
228 |
### Roest-wav2vec2-315M with and without language model
|
229 |
The inclusion of a post-processing language model can affect the performance significantly. The Roest-v1 and Roest-v2 models are using the same Language Model (LM). The utilized LM is the one trained and used by [alexandrainst/roest-wav2vec2-315m-v1](https://huggingface.co/alexandrainst/roest-315m).
|