--- license: mit --- # SSL4PR WavLM Base This repository hosts the pre-trained SSL4PR WavLM Base models for Parkinson's Disease detection from speech in real-world operating conditions. These models are based on the work titled "Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions" by Moreno La Quatra et al. ## Repository Link [GitHub Repository](https://github.com/K-STMLab/SSL4PR/) ## Pre-trained Models Pre-trained models are available on the Hugging Face model hub. To use the SSL4PR WavLM Base models, please clone the repository by running the following command: ```bash git clone https://huggingface.co/morenolq/SSL4PR-hubert-base ``` Ensure you have git lfs installed. Each repository contains the pre-trained models, one per fold, named `fold_1.pt`, `fold_2.pt`, ..., `fold_10.pt`. The models are available in PyTorch format. - [SSL4PR WavLM Base](https://huggingface.co/morenolq/SSL4PR-wavlm-base) - [SSL4PR HuBERT Base](https://huggingface.co/morenolq/SSL4PR-hubert-base) - **this repository** ## Citation ```bibtex @inproceedings{laquatra24_interspeech, title = {Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions}, author = {Moreno {La Quatra} and Maria Francesca Turco and Torbjørn Svendsen and Giampiero Salvi and Juan Rafael Orozco-Arroyave and Sabato Marco Siniscalchi}, year = {2024}, booktitle = {Interspeech 2024}, pages = {1405--1409}, doi = {10.21437/Interspeech.2024-522}, issn = {2958-1796}, } ```