Whisper Large v3 has been fine-tuned on Common Voice 17, leveraging over 250,000 Persian audio samples—a significant improvement over earlier models trained on Common Voice 11, which contained only 83,000 samples. This larger dataset has resulted in a lower Word Error Rate (WER), enhancing the model’s accuracy and robustness in recognizing Persian speech.

This update marks a major step forward in Persian ASR, and we hope it benefits the Persian-speaking community, making high-quality speech recognition more accessible and reliable. 🚀

Feature Description
Model Name Whisper Large v3 - Persian (Common Voice 17)
Base Model Whisper Large v3
Language Persian (Farsi)
Dataset Mozilla Common Voice 17 (Persian subset)
Hardware Used NVIDIA A100 GPU
Batch Size 16
Training Steps 5000
WER (Word Error Rate) 21.43

How to Use

from transformers import pipeline



asr_pipe = pipeline(
    "automatic-speech-recognition",
    model="MohammadGholizadeh/whisper-large-v3-persian-common-voice-17",
    chunk_length_s=30
)

text = asr_pipe("your_file")["text"]
print(text)

Notes

Since the fine tuning process does not include any timestamps, the model can not return any timestamps. Even when you are trying to return it, you would encounter an Error. The solution is to chunk audio files into smaller chunks. Further fine tuning would definitely increase the accuracy of the model. We are currently looking for sponserships for Hardware and ASR dataset collaborations.

@misc{whisper_persian_cv17,
  author = {Mohammad Sadegh Gholizadeh},
  title = {Whisper Large v3 - Persian (Common Voice 17)},
  year = {2025},
  url = {https://huggingface.co/msghol/whisper-large-v3-persian-common-voice-17}
}
Downloads last month
1,141
Safetensors
Model size
1.54B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MohammadGholizadeh/whisper-large-v3-persian-common-voice-17

Finetuned
(560)
this model

Dataset used to train MohammadGholizadeh/whisper-large-v3-persian-common-voice-17

Spaces using MohammadGholizadeh/whisper-large-v3-persian-common-voice-17 2