YorubaSTEMT5 / README.md
gbelewade's picture
Upload English-Yoruba STEM translation model
7da30bf verified

English-Yoruba STEM Translation Model

This model is trained to translate English STEM content to Yoruba.

Model Details

  • Architecture: Transformer-based sequence-to-sequence model
  • Base Model: Davlan/mt5-base-en-yor-mt
  • Training Data: YorubaSTEM1.0
  • Performance: BLEU: 36.08

Usage

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("gbelewade/YorubaSTEMt5") model = AutoModelForSeq2SeqLM.from_pretrained("gbelewade/YorubaSTEMt5")

Translate English text to Yoruba

english_text = "The chemical formula for water is H2O." inputs = tokenizer(english_text, return_tensors="pt") outputs = model.generate(**inputs) yoruba_text = tokenizer.decode(outputs[0], skip_special_tokens=True) print(yoruba_text)

Limitations

[Describe any known limitations of the model]

Citation