Commit
·
d154d92
1
Parent(s):
f512f18
Add arXiv link.
Browse files
README.md
CHANGED
@@ -120,6 +120,8 @@ license: cc-by-4.0
|
|
120 |
|
121 |
### `espnet/geolid_vl107only_shared_frozen`
|
122 |
|
|
|
|
|
123 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
124 |
|
125 |
The main innovations of this model are:
|
@@ -127,7 +129,7 @@ The main innovations of this model are:
|
|
127 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
128 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
129 |
|
130 |
-
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* (arXiv
|
131 |
|
132 |
### Usage Guide: How to use in ESPnet2
|
133 |
|
|
|
120 |
|
121 |
### `espnet/geolid_vl107only_shared_frozen`
|
122 |
|
123 |
+
[Paper](https://arxiv.org/pdf/2508.17148)
|
124 |
+
|
125 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
126 |
|
127 |
The main innovations of this model are:
|
|
|
129 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
130 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
131 |
|
132 |
+
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* ([arXiv](https://arxiv.org/pdf/2508.17148)).
|
133 |
|
134 |
### Usage Guide: How to use in ESPnet2
|
135 |
|