Update README.md
Browse files
README.md
CHANGED
@@ -2,8 +2,27 @@
|
|
2 |
tags:
|
3 |
- model_hub_mixin
|
4 |
- pytorch_model_hub_mixin
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
|
7 |
-
This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
|
8 |
- Library: https://github.com/tiantiaf0627/vox-profile-release
|
9 |
- Docs: [More Information Needed]
|
|
|
2 |
tags:
|
3 |
- model_hub_mixin
|
4 |
- pytorch_model_hub_mixin
|
5 |
+
license: apache-2.0
|
6 |
+
language:
|
7 |
+
- en
|
8 |
+
metrics:
|
9 |
+
- accuracy
|
10 |
+
base_model:
|
11 |
+
- microsoft/wavlm-large
|
12 |
---
|
13 |
+
# WavLM-Large for Voice (Sounding) Quality Classification
|
14 |
+
|
15 |
+
# Model Description
|
16 |
+
This model includes the implementation of voice quality classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
|
17 |
+
|
18 |
+
The included labels are: [
|
19 |
+
'shrill', 'nasal', 'deep', # Pitch
|
20 |
+
'silky', 'husky', 'raspy', 'guttural', 'vocal-fry', # Texture
|
21 |
+
'booming', 'authoritative', 'loud', 'hushed', 'soft', # Volume
|
22 |
+
'crisp', 'slurred', 'lisp', 'stammering', # Clarity
|
23 |
+
'singsong', 'pitchy', 'flowing', 'monotone', 'staccato', 'punctuated', 'enunciated', 'hesitant', # Rhythm
|
24 |
+
]
|
25 |
+
|
26 |
|
|
|
27 |
- Library: https://github.com/tiantiaf0627/vox-profile-release
|
28 |
- Docs: [More Information Needed]
|