tiantiaf
/

whisper-large-v3-narrow-accent

Audio Classification

model_hub_mixin

pytorch_model_hub_mixin

speaker_accent_classification

Model card Files Files and versions Community

tiantiaf commited on 18 days ago

Commit

7a92ee6

·

verified ·

1 Parent(s): c174859

Push model using huggingface_hub.

Files changed (3) hide show

README.md +1 -23
config.json +11 -0
model.safetensors +3 -0

README.md CHANGED Viewed

@@ -2,30 +2,8 @@
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
-license: bsd-3-clause
-language:
-- en
-metrics:
-- accuracy
-base_model:
-- microsoft/wavlm-large
-pipeline_tag: audio-classification
 ---
-# Whisper-Large v3 for Narrow Accent Classification
-# Model Description
-This model includes the implementation of narrow accent classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
-The included English accents are:
-<pre>
-[
-  'East Asia', 'English', 'Germanic', 'Irish',
-  'North America', 'Northern Irish', 'Oceania',
-  'Other', 'Romance', 'Scottish', 'Semitic', 'Slavic',
-  'South African', 'Southeast Asia', 'South Asia', 'Welsh'
-]
-</pre>
 - Library: https://github.com/tiantiaf0627/vox-profile-release
 - Docs: [More Information Needed]

 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
 - Library: https://github.com/tiantiaf0627/vox-profile-release
 - Docs: [More Information Needed]

config.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "apply_gradient_reversal": false,
+  "finetune_method": "lora",
+  "freeze_params": true,
+  "hidden_dim": 256,
+  "lora_rank": 16,
+  "num_dataset": 3,
+  "output_class_num": 16,
+  "pretrain_model": "whisper_large",
+  "use_conv_output": true
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d4b2a7ac878e3b134ee597f40f9c03b245c938718745dc9d20c031b20d6a3f6a
+size 6184697900