tiantiaf
/

wavlm-large-voice-quality

Audio Classification

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

tiantiaf commited on May 21

Commit

0a6b14d

·

verified ·

1 Parent(s): 4e688ac

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -2,8 +2,27 @@
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
-This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
 - Library: https://github.com/tiantiaf0627/vox-profile-release
 - Docs: [More Information Needed]

 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
+license: apache-2.0
+language:
+- en
+metrics:
+- accuracy
+base_model:
+- microsoft/wavlm-large
 ---
+# WavLM-Large for Voice (Sounding) Quality Classification
+# Model Description
+This model includes the implementation of voice quality classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
+The included labels are: [
+    'shrill', 'nasal', 'deep',  # Pitch
+    'silky', 'husky', 'raspy', 'guttural', 'vocal-fry', # Texture
+    'booming', 'authoritative', 'loud', 'hushed', 'soft', # Volume
+    'crisp', 'slurred', 'lisp', 'stammering', # Clarity
+    'singsong', 'pitchy', 'flowing', 'monotone', 'staccato', 'punctuated', 'enunciated',  'hesitant', # Rhythm
+]
 - Library: https://github.com/tiantiaf0627/vox-profile-release
 - Docs: [More Information Needed]