---
license: mit
---

# SSL4PR WavLM Base

This repository hosts the pre-trained SSL4PR WavLM Base models for Parkinson's Disease detection from speech in real-world operating conditions. These models are based on the work titled "Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions" by Moreno La Quatra et al.

## Repository Link
[GitHub Repository](https://github.com/K-STMLab/SSL4PR/)

## Pre-trained Models
Pre-trained models are available on the Hugging Face model hub. To use the SSL4PR WavLM Base models, please clone the repository by running the following command:

```bash
git clone https://huggingface.co/morenolq/SSL4PR-hubert-base
```

Ensure you have git lfs installed. Each repository contains the pre-trained models, one per fold, named `fold_1.pt`, `fold_2.pt`, ..., `fold_10.pt`.
The models are available in PyTorch format. 

- [SSL4PR WavLM Base](https://huggingface.co/morenolq/SSL4PR-wavlm-base)
- [SSL4PR HuBERT Base](https://huggingface.co/morenolq/SSL4PR-hubert-base) - **this repository**

## Citation

```bibtex
@inproceedings{laquatra24_interspeech,
  title     = {Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions},
  author    = {Moreno {La Quatra} and Maria Francesca Turco and Torbjørn Svendsen and Giampiero Salvi and Juan Rafael Orozco-Arroyave and Sabato Marco Siniscalchi},
  year      = {2024},
  booktitle = {Interspeech 2024},
  pages     = {1405--1409},
  doi       = {10.21437/Interspeech.2024-522},
  issn      = {2958-1796},
}
```