You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

AitASR

AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the farabi-lab/kazakh-stt dataset to improve transcription quality for Kazakh audio.

🔧 Intended Use

The model is designed for ASR tasks involving Kazakh-language audio.
It is suitable for:

Transcription of Kazakh speech
Voice command recognition
Speech-driven applications in Kazakh

⚠️ Limitations

May perform poorly on:
- Low-quality or noisy audio
- Audio from domains significantly different from the training data
Not optimized for real-time use without further engineering

5. Citation

If you use this model, please cite it as follows:

@article{kadyrbek2023ksd,
  author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
  title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
  journal = {Big Data and Cognitive Computing},
  year = {2023},
  volume = {7},
  number = {3},
  pages = {132},
  doi = {https://doi.org/10.3390/bdcc7030132}
}```

---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.

Downloads last month: 6

Safetensors

Model size

0.2B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nur-dev/ait-asr

Base model

openai/whisper-small

Finetuned

(3163)

this model

nur-dev
/

ait-asr