--- language: - uz license: apache-2.0 tags: - automatic-speech-recognition - mozilla-foundation/common_voice_10_0 - generated_from_trainer datasets: - common_voice_10_0 model-index: - name: uzbek_stt_5_version results: [] --- # O'zbekcha SpeechToText 5-versiyasi Bu model [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) va MOZILLA-FOUNDATION/COMMON_VOICE_10_0 - versiyasining dataseti bilan o'qitildi. Model o'qitilgandan keyin quyidagi natijalarga erishildi: - Xatolik: 1.8085 - So'zlarning xatolik darajasi: 0.9421 ## Model haqida Model 2 kun davomida 2xRTX3090 24GBli Video kartada o'qitildi. ### Modelni o'qitish uchun quyidagi giperparameterlarni qo'ydik: Quyidagi giperparameterlar model o'qitish jarayonida ishlatildi: - learning_rate: 3e-05 - train_batch_size: 32 - eval_batch_size: 16 - seed: 42 - distributed_type: multi-GPU - num_devices: 2 - total_train_batch_size: 64 - total_eval_batch_size: 32 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - lr_scheduler_warmup_steps: 500 - num_epochs: 30.0 - mixed_precision_training: Native AMP ### O'qitish natijalari | Xatolik | Epox | Qadam | Tasdiq xatoligi | SXD | |:-------------:|:-----:|:-----:|:---------------:|:------:| | 0.3452 | 5.45 | 5000 | 0.3839 | 0.4574 | | 0.2466 | 10.91 | 10000 | 0.4011 | 0.4067 | | 1.5753 | 16.36 | 15000 | 1.2937 | 0.8844 | | 1.9454 | 21.81 | 20000 | 1.8227 | 0.9392 | | 1.922 | 27.26 | 25000 | 1.8085 | 0.9421 |