Tom9358's picture
Update README.md
55bb6ae verified
metadata
language:
  - gos
  - nld
  - nl
  - gos
datasets:
  - Tom9358/tatoeba_21-dec-2024
base_model:
  - facebook/nllb-200-distilled-1.3B
pipeline_tag: translation
tags:
  - language
  - linguistics
  - low-resource
  - translation
  - tatoeba
  - nllb
  - machine-translation
  - gronings

Consider this an early beta version. I used sentence pairs from https://tatoeba.org/ to finetune an NLLB model for Gronings.

I am a linguist and speaker of Gronings so I carried out evaluation by expert's eyeball. I haven't thoroughly investigated the performance by means of BLEU scores or anything for this version. Nonetheless, I found the performance to be not terrible and thus decided to publish. The model is very likely not optimal in terms of hyperparameters, so I am planning to upload an even better version in the future. See https://github.com/tom9358/nllb-tryout for everything (code, more documentation and references) except the model itself

A (rather slow, but at least free and accessible to everyone) way to try out the model: https://colab.research.google.com/drive/1b5dn3VT4fvOBKly1CIx4Qwo59GDM1H-M