--- license: mit language: - ar --- ## Checkpoints ### Pre-Trained Models Model | Pre-train Dataset | Model | Tokenizer | | --- | --- | --- | --- | | ArTST v2 base | Dialects | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/pretrain_checkpoint.pt) | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) ### Finetuned Models Model | FInetune Dataset | Model | Tokenizer | | --- | --- | --- | --- | | ArTST v2 ASR | MGB2 | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/ASR_MGB2_best.pt_hf.pt) | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | | ArTST v2 ASR | QASR | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/ASR_QASR_best.pt_hf.pt) | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | | ArTST v2 ASR | MGB2 - Dialects | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/ASR_Dialects_MGB2_best.pt_hf.pt) | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | | ArTST v2 ASR | MGB2 - Dialects | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/ASR_Dialects_QASR_best.pt_hf.pt) | [Hugging Face](https://huggingface.co/MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | # Acknowledgements ArTST is built on [SpeechT5](https://arxiv.org/abs/2110.07205) Architecture. If you use any of ArTST models, please cite ``` @inproceedings{toyin2023artst, title={ArTST: Arabic Text and Speech Transformer}, author={Toyin, Hawau and Djanibekov, Amirbek and Kulkarni, Ajinkya and Aldarmaki, Hanan}, booktitle={Proceedings of ArabicNLP 2023}, pages={41--51}, year={2023} } ```