pyf98 commited on
Commit
4290277
·
verified ·
1 Parent(s): 5128950

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ OWSM aims to develop fully open speech foundation models using publicly availabl
19
  Inference examples can be found on our [project page](https://www.wavlab.org/activities/2024/owsm/).
20
  The Gradio demo is [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
21
 
22
- [OWSM v4]() is the latest version in the OWSM series, which significantly outperforms OWSM v3.1 in LID and multilingual ASR.
23
  Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
24
  When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
25
 
 
19
  Inference examples can be found on our [project page](https://www.wavlab.org/activities/2024/owsm/).
20
  The Gradio demo is [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
21
 
22
+ [OWSM v4](https://arxiv.org/abs/2506.00338) is the latest version in the OWSM series, which significantly outperforms OWSM v3.1 in LID and multilingual ASR.
23
  Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
24
  When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
25