islomov
/

navaistt_v2_medium

Automatic Speech Recognition

audio-transcription

speech-recognition

Model card Files Files and versions Community

islomov commited on Jun 16

Commit

df9fae2

·

verified ·

1 Parent(s): c6e59ec

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -35,12 +35,12 @@ Support my works and open-source movement: https://tirikchilik.uz/islomovs
 ## Training Data
 This model was fine-tuned on approximately 475 hours of diverse Uzbek audio data including:
-- Publicly available podcasts
-- Tashkent dialect podcasts
-- News
-- Google fleurs
-- USC
-- Common Voice 17 dataset
 The dataset consisted of 50% human-transcribed and 50% pseudo-transcribed material (using Gemini 2.5 Pro). Special attention was given to Tashkent dialect audio materials to ensure strong performance on this dialect.

 ## Training Data
 This model was fine-tuned on approximately 475 hours of diverse Uzbek audio data including:
+- Common Voice 17 dataset (filtered)
+- USC (filtered)
+- Google fleurs (filtered)
+- Podcasts Tashkent Dialect Youtube Uzbek Speech Dataset: [Link HF](https://huggingface.co/datasets/islomov/podcasts_tashkent_dialect_youtube_uzbek_speech_dataset)
+- News Youtube Uzbek Speech Dataset: [Link HF](https://huggingface.co/datasets/islomov/news_youtube_uzbek_speech_dataset)
+- IT Youtube Uzbek Speech Dataset: [Link HF](https://huggingface.co/datasets/islomov/it_youtube_uzbek_speech_dataset)
 The dataset consisted of 50% human-transcribed and 50% pseudo-transcribed material (using Gemini 2.5 Pro). Special attention was given to Tashkent dialect audio materials to ensure strong performance on this dialect.