You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

AitASR

AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the farabi-lab/kazakh-stt dataset to improve transcription quality for Kazakh audio.


🔧 Intended Use

The model is designed for ASR tasks involving Kazakh-language audio.
It is suitable for:

  • Transcription of Kazakh speech
  • Voice command recognition
  • Speech-driven applications in Kazakh

⚠️ Limitations

  • May perform poorly on:
    • Low-quality or noisy audio
    • Audio from domains significantly different from the training data
  • Not optimized for real-time use without further engineering

5. Citation

If you use this model, please cite it as follows:

@article{kadyrbek2023ksd,
  author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
  title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
  journal = {Big Data and Cognitive Computing},
  year = {2023},
  volume = {7},
  number = {3},
  pages = {132},
  doi = {https://doi.org/10.3390/bdcc7030132}
}```

---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.
Downloads last month
0
Safetensors
Model size
242M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nur-dev/ait-asr

Finetuned
(2559)
this model

Dataset used to train nur-dev/ait-asr