Adding information from new training.
Browse files
README.md
CHANGED
@@ -26,6 +26,8 @@ Trained on +-130hrs
|
|
26 |
and upgraded to new dataset +-200hrs 30k samples in 2 days with mostly 20s on A100
|
27 |
Added 3 days on 3500 speakers from, 60k samples audios joined with from 20 to 25s of
|
28 |
https://commonvoice.mozilla.org/pt
|
|
|
|
|
29 |
|
30 |
Use lower case, and for numbers use num2words. Sample bellow.
|
31 |
|
@@ -90,13 +92,31 @@ time data : 183:27:23
|
|
90 |
min sec : 1.02
|
91 |
max sec : 30.0
|
92 |
vocab : 2545
|
|
|
93 |
|
94 |
-
|
95 |
samples : 128908
|
96 |
time data : 196:24:47
|
97 |
min sec : 1.0
|
98 |
max sec : 25.0
|
99 |
vocab : 2545
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
100 |
|
101 |
|
102 |
License
|
|
|
26 |
and upgraded to new dataset +-200hrs 30k samples in 2 days with mostly 20s on A100
|
27 |
Added 3 days on 3500 speakers from, 60k samples audios joined with from 20 to 25s of
|
28 |
https://commonvoice.mozilla.org/pt
|
29 |
+
Added 2 days on 90k samples audios, colab A100.
|
30 |
+
Added 5 days on 90k samples audios, with 2 rtx3090 48gb parallel.
|
31 |
|
32 |
Use lower case, and for numbers use num2words. Sample bellow.
|
33 |
|
|
|
92 |
min sec : 1.02
|
93 |
max sec : 30.0
|
94 |
vocab : 2545
|
95 |
+
mostly 5s
|
96 |
|
97 |
+
around 4 days ( 800k steps )
|
98 |
samples : 128908
|
99 |
time data : 196:24:47
|
100 |
min sec : 1.0
|
101 |
max sec : 25.0
|
102 |
vocab : 2545
|
103 |
+
mostly 10s
|
104 |
+
|
105 |
+
around 7 days (1000k steps)
|
106 |
+
samples : 90947
|
107 |
+
time data : 447:51:31
|
108 |
+
min sec : 3.0
|
109 |
+
max sec : 30.0
|
110 |
+
vocab : 2545
|
111 |
+
|
112 |
+
Total audios: 90,947
|
113 |
+
Minimum duration: 1.02 seconds
|
114 |
+
Maximum duration: 30.0 seconds
|
115 |
+
Mean (average) duration: 17.73 seconds
|
116 |
+
Median (P50): 19.60 seconds
|
117 |
+
P90 (90th percentile): 23.90 seconds
|
118 |
+
P95 (95th percentile): 25.96 seconds
|
119 |
+
|
120 |
|
121 |
|
122 |
License
|