asadullah797
/

ssl-semi-multitask

Audio Classification

automatic-speech-recognition

emotion-recognition

speaker-identification

Model card Files Files and versions

asadullah797 commited on 4 days ago

Commit

55e7c20

·

verified ·

1 Parent(s): 4a12281

Push model using huggingface_hub.

Files changed (2) hide show

README.md +6 -47
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -4,53 +4,12 @@ pipeline_tag: audio-classification
 tags:
 - automatic-speech-recognition
 - emotion-recognition
 - speaker-identification
-language:
-- en
-metrics:
-- accuracy
-base_model:
-- facebook/wav2vec2-base
-library_name: fairseq
 ---
-# Multitask Speech Model with Wav2Vec2
-This repository contains a multitask learning pipeline built on top of Wav2Vec2
-, designed to jointly perform:
-Automatic Speech Recognition (ASR) (character-level CTC loss)
-Speaker Identification
-Emotion Recognition
-The system is trained on a combination of training dataset with parallel data from speech transcriptions, speaker identification and emotion recognition labels.
-📌 Features
-Multitask model (Wav2Vec2MultiTasks) with shared Wav2Vec2 encoder and separate heads for:
-Speech Recognition (CTC)
-Speaker classification
-Emotion classification
-Custom data preprocessing:
-Cleans transcripts (removes punctuation & special characters)
-Converts numbers into words
-Builds a vocabulary and tokenizer
-Filters short/invalid audio
-Training, validation, and test splits with collators for CTC.
-Evaluation metrics:
-Character Error Rate (CER) for character recognition
-Accuracy for speaker and emotion classification

 tags:
 - automatic-speech-recognition
 - emotion-recognition
+- model_hub_mixin
+- pytorch_model_hub_mixin
 - speaker-identification
 ---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Code: https://huggingface.co/asadullah797/ssl-semi-multitask
+- Paper: [More Information Needed]
+- Docs: https://github.com/asadullah797/ssl_semi-multitask/blob/main/README.md

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:26b18f5c1cb8c8e8cdc92a34c7aea96333fcf077093bc54f9c54dd0ced92a8a7
 size 378804760

 version https://git-lfs.github.com/spec/v1
+oid sha256:f0e55768c14c7f08f8271d7e8eae064c585a1659e910182435fb1c516a8a650f
 size 378804760