You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Whisper Large V3 Turbo (WLV3t) trained on denoised-sgatc with

  • The following Augmentations (THLB):
    • T: tanh distortion
    • H: high pass
    • L: low pass
    • B: band pass

Citation

If you use the data, please cite the following paper:

@misc{wee2025adaptingautomaticspeechrecognition,
      title={Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications}, 
      author={Marcus Yu Zhe Wee and Justin Juin Hng Wong and Lynus Lim and Joe Yu Wei Tan and Prannaya Gupta and Dillion Lim and En Hao Tew and Aloysius Keng Siew Han and Yong Zhi Lim},
      year={2025},
      eprint={2502.20311},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.20311}, 
}
Downloads last month
0
Safetensors
Model size
809M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for aether-raid/WLV3t-dSG-THLB

Finetuned
(208)
this model

Collection including aether-raid/WLV3t-dSG-THLB