Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ OWSM aims to develop fully open speech foundation models using publicly availabl
|
|
19 |
Inference examples can be found on our [project page](https://www.wavlab.org/activities/2024/owsm/).
|
20 |
The Gradio demo is [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
|
21 |
|
22 |
-
[OWSM v4]() is the latest version in the OWSM series, which significantly outperforms OWSM v3.1 in LID and multilingual ASR.
|
23 |
Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
|
24 |
When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
|
25 |
|
|
|
19 |
Inference examples can be found on our [project page](https://www.wavlab.org/activities/2024/owsm/).
|
20 |
The Gradio demo is [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
|
21 |
|
22 |
+
[OWSM v4](https://arxiv.org/abs/2506.00338) is the latest version in the OWSM series, which significantly outperforms OWSM v3.1 in LID and multilingual ASR.
|
23 |
Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
|
24 |
When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
|
25 |
|