Model Card for Model ID
This is a fine-tuned model for automatic dialectal transcription of Norwegian dialect recordings. The model is based on the XLS-R large model. The model has been finetuned on old Norwegian dialect recordings and their corresponding transcriptions. This model outputs simple transcription. The audio recordings are sampled at 16kHz.
Model Sources [optional]
- Paper [optional]: TBA
Uses
You can use this model for automatic dialectal transcription of Norwegian dialects. Note that this model does not produce standard bokmål or nynorsk text.
How to Get Started with the Model
Use the code below to get started with the model.
[More Information Needed]
Training Details
Training Data
The training data is an utterance-level version of the LIA Norwegian corpus. The utterance-level version is available at okuparinen/skn.
Training Procedure
Preprocessing [optional]
[More Information Needed]
Training Hyperparameters
- Training regime: [More Information Needed]
Speeds, Sizes, Times [optional]
[More Information Needed]
Evaluation
Testing Data, Factors & Metrics
Testing Data
[More Information Needed]
Factors
[More Information Needed]
Metrics
[More Information Needed]
Results
[More Information Needed]
Summary
Citation [optional]
BibTeX:
[More Information Needed]
APA:
[More Information Needed]
Glossary [optional]
[More Information Needed]
More Information [optional]
[More Information Needed]
Model Card Authors [optional]
[More Information Needed]
Model Card Contact
[More Information Needed]
- Downloads last month
- 23
Model tree for okuparinen/LIA_300m_simple
Base model
facebook/wav2vec2-large-xlsr-53