Organization Card

Uganda Open Source AI Lab (USOAL)

USOAL is an open initiative created by passionate machine learning researchers in Uganda to share exciting models and findings from the local research community.

We aim to contribute to the growth of African language technologies by building open, high-quality models for speech, language, and other AI applications.

1. Uganda Text-to-Speech (TTS)

A collection of fine-tuned Orpheus 3B models that generate natural-sounding speech in multiple Ugandan languages including English, Luganda, Runyankole, Teso, and Acholi. These Models were built on top of open-sourced datasets from SunBird AI, Yogera and Mozilla's Common Voice Dataset.

🔉 Audio Examples (With Prompts)

Language	Voice	Prompt
English	Christopher	Hello I can speak in English as Christopher, one of the voices I can speak.
English	Barbara	Or as Barbara, this is one of my female voices. Pretty cool, right?
English	Mary	I can also speak as Mary as well.
English	James	Or I can speak as James, as you can see.
English	Jessica	This is my other voice called Jessica. I have more voices like Jennifer, Susan, Linda, Patricia, and Elizabeth, which I’ll share when they’re ready.
Luganda	Christopher	Nsobola okwo’geranga Christopher nga wowulila kati.
Luganda	Charles	Oba neenjogela nga Charles wenti.
Luganda	Sandra	Nina neddoboozi lya Sandra bweliti.
Luganda	Michelle	Nsobola ogwogella bwenti mulino eddoboozi.
Luganda	Daniel	Oba nemulino elye’kisajja nga woowulira.
Runyankole	Christopher	Nimbasa kugamba nka Christopher omwiraka eri.
Runyankole	Patricia	Bimwe ebirikugambwa aha reediyo nibihwera abantu kumanya obutare burungi bw’amasharuura gaabo.
Runyankole	Elizabeth	Omu disiturikiti ya Kayunga emisiri erikukira obwngi ekashangwa erimu ebicoori ebiine oburwaire.
Runyankole	Michelle	Nimbasa kugamba nka Michelle omwiraka eri.
Runyankole	James	Uganda eteire amaani aha buhingi n’oburiisa.
Teso	Christopher	Epedorete akoriok aimedaun ejok kanejaas aicoreta nu itikitikere adeka.
Teso	Jessica	Akoru ikorion luegelegela nes ingarakini itunganan.
Teso	James	Iraasit yen emunaara aticepak ikur enyamitos.
Teso	Daniel	Aipagisanar nes ewai ecie lo ibwaikinet iboro toma aswam.
Teso	Barbara	Isisianakinete isomeroi kwana asiomak eipone lo isubusaere.
Acholi	Mark	Uganda tye ka keme ki lok me pur.
Acholi	Barbara	Lupur twero nongo kony ma dit ka gunongo ngec me gengo onyo cango two ma balo jami ma i poto.
Acholi	Michelle	Gum madwong me timo biacara tye i te yub ma pe jenge i kom gamente.
Acholi	James	Ler ma pe gidodo ma woto ka yenyo cam i dye poto obalo cam weng ma tye i poto.

🛠️ How it Works

These TTS models are built using a two-stage architecture:

Audio Token Generation
Uses SNAC (Structured Neural Audio Codec) to convert text into audio tokens.
TTS Model Fine-Tuning
Fine-tuned versions of the Orpheus 3B model convert the audio tokens into realistic speech in multiple Ugandan languages.

⚠️ Note: Some non-English outputs may sound lower in quality due to SNAC not being pretrained on local African phonetics.

The Github repository shares more details about the models
📦 Code: GitHub Repository for Uganda TTS

2. Uganda Text Generation (Advanced)

We are also training powerful models that can understand and generate text in low-resource Ugandan languages.

📦 Code: GitHub Repository for Uganda Text Generation

models 10

Uganda Open Source AI Lab (USOAL)

AI & ML interests

Uganda Open Source AI Lab (USOAL)

Table of Contents

1. Uganda Text-to-Speech (TTS)

🔉 Audio Examples (With Prompts)

🛠️ How it Works

2. Uganda Text Generation (Advanced)

models 10

USOAL/Orpheus-Teo-GGUF

USOAL/Orpheus-Nyn-GGUF

USOAL/Orpheus-Lug-GGUF

USOAL/Orpheus-nyn-multi-speaker

USOAL/Orpheous-Eng-GGUF

USOAL/Orpheous-teo-multi-speaker

USOAL/Orpheous-Eng-Multi-speaker

USOAL/Orpheous-Ach-Multi-speaker

USOAL/Orpheous-Multi-speaker-Lug

USOAL/orpheous-full-lug-data

datasets 1

USOAL/Sunbird-salt-with-gender

AI & ML interests

Team members 2

Uganda Open Source AI Lab (USOAL)

Table of Contents

1. Uganda Text-to-Speech (TTS)

🔉 Audio Examples (With Prompts)

🛠️ How it Works

2. Uganda Text Generation (Advanced)

models 10 Sort: Recently updated

datasets 1

models 10