Audio
Collection
Dhivehi Voice AI Collection: Tools for Thaana speech recognition (ASR), text-to-speech (TTS), and audio processing
•
32 items
•
Updated
•
3
This model is a fine-tuned version of openai/whisper-large-v3 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.7327 | 0.0213 | 300 | 0.5098 | 93.9655 |
0.0845 | 0.0425 | 600 | 0.3085 | 75.8621 |
0.0512 | 0.0213 | 900 | 0.2694 | 70.8621 |
0.0333 | 0.0425 | 1200 | 0.2489 | 66.0345 |
0.0435 | 0.0638 | 1500 | 0.2286 | 64.4828 |
0.037 | 0.0851 | 1800 | 0.2187 | 64.4828 |
0.0337 | 0.1063 | 2100 | 0.2117 | 62.7586 |
0.0303 | 0.1276 | 2400 | 0.2036 | 60.5172 |
0.0282 | 0.1489 | 2700 | 0.1898 | 59.3103 |
0.0267 | 0.1701 | 3000 | 0.1864 | 59.8276 |
0.0272 | 0.1914 | 3300 | 0.1788 | 59.4828 |
0.0239 | 0.2126 | 3600 | 0.1759 | 58.4483 |
0.0229 | 0.2339 | 3900 | 0.1718 | 59.3103 |
Base model
openai/whisper-large-v3