AitASR
AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the farabi-lab/kazakh-stt
dataset to improve transcription quality for Kazakh audio.
🔧 Intended Use
The model is designed for ASR tasks involving Kazakh-language audio.
It is suitable for:
- Transcription of Kazakh speech
- Voice command recognition
- Speech-driven applications in Kazakh
⚠️ Limitations
- May perform poorly on:
- Low-quality or noisy audio
- Audio from domains significantly different from the training data
- Not optimized for real-time use without further engineering
5. Citation
If you use this model, please cite it as follows:
@article{kadyrbek2023ksd,
author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
journal = {Big Data and Cognitive Computing},
year = {2023},
volume = {7},
number = {3},
pages = {132},
doi = {https://doi.org/10.3390/bdcc7030132}
}```
---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.
- Downloads last month
- 0
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for nur-dev/ait-asr
Base model
openai/whisper-small