Model Card for Model ID

This is a fine-tuned model for automatic dialectal transcription of Norwegian dialect recordings. The model is based on the XLS-R large model. The model has been finetuned on old Norwegian dialect recordings and their corresponding transcriptions. This model outputs simple transcription. The audio recordings are sampled at 16kHz.

Model Sources [optional]

Paper [optional]: TBA

Uses

You can use this model for automatic dialectal transcription of Norwegian dialects. Note that this model does not produce standard bokmål or nynorsk text.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

The training data is an utterance-level version of the LIA Norwegian corpus. The utterance-level version is available at okuparinen/skn.

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

okuparinen
/

LIA_300m_simple

Model Card for Model ID

Model Sources [optional]

Uses

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Preprocessing [optional]

Training Hyperparameters

Speeds, Sizes, Times [optional]

Evaluation

Testing Data, Factors & Metrics

Testing Data

Factors

Metrics

Results

Summary

Citation [optional]

Glossary [optional]

More Information [optional]

Model Card Authors [optional]

Model Card Contact

Model tree for okuparinen/LIA_300m_simple

Dataset used to train okuparinen/LIA_300m_simple

Collection including okuparinen/LIA_300m_simple

dialectal-transcription (fi, no)