asadullah797 commited on
Commit
55e7c20
·
verified ·
1 Parent(s): 4a12281

Push model using huggingface_hub.

Browse files
Files changed (2) hide show
  1. README.md +6 -47
  2. model.safetensors +1 -1
README.md CHANGED
@@ -4,53 +4,12 @@ pipeline_tag: audio-classification
4
  tags:
5
  - automatic-speech-recognition
6
  - emotion-recognition
 
 
7
  - speaker-identification
8
- language:
9
- - en
10
- metrics:
11
- - accuracy
12
- base_model:
13
- - facebook/wav2vec2-base
14
- library_name: fairseq
15
  ---
16
 
17
- # Multitask Speech Model with Wav2Vec2
18
-
19
- This repository contains a multitask learning pipeline built on top of Wav2Vec2
20
- , designed to jointly perform:
21
-
22
- Automatic Speech Recognition (ASR) (character-level CTC loss)
23
-
24
- Speaker Identification
25
-
26
- Emotion Recognition
27
-
28
- The system is trained on a combination of training dataset with parallel data from speech transcriptions, speaker identification and emotion recognition labels.
29
-
30
- 📌 Features
31
-
32
- Multitask model (Wav2Vec2MultiTasks) with shared Wav2Vec2 encoder and separate heads for:
33
-
34
- Speech Recognition (CTC)
35
-
36
- Speaker classification
37
-
38
- Emotion classification
39
-
40
- Custom data preprocessing:
41
-
42
- Cleans transcripts (removes punctuation & special characters)
43
-
44
- Converts numbers into words
45
-
46
- Builds a vocabulary and tokenizer
47
-
48
- Filters short/invalid audio
49
-
50
- Training, validation, and test splits with collators for CTC.
51
-
52
- Evaluation metrics:
53
-
54
- Character Error Rate (CER) for character recognition
55
-
56
- Accuracy for speaker and emotion classification
 
4
  tags:
5
  - automatic-speech-recognition
6
  - emotion-recognition
7
+ - model_hub_mixin
8
+ - pytorch_model_hub_mixin
9
  - speaker-identification
 
 
 
 
 
 
 
10
  ---
11
 
12
+ This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
13
+ - Code: https://huggingface.co/asadullah797/ssl-semi-multitask
14
+ - Paper: [More Information Needed]
15
+ - Docs: https://github.com/asadullah797/ssl_semi-multitask/blob/main/README.md
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:26b18f5c1cb8c8e8cdc92a34c7aea96333fcf077093bc54f9c54dd0ced92a8a7
3
  size 378804760
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f0e55768c14c7f08f8271d7e8eae064c585a1659e910182435fb1c516a8a650f
3
  size 378804760