File size: 1,086 Bytes
3b045b5
 
 
 
 
e075d22
3b045b5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55bb6ae
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
language:
- gos
- nld
- nl
- gos
datasets:
- Tom9358/tatoeba_21-dec-2024
base_model:
- facebook/nllb-200-distilled-1.3B
pipeline_tag: translation
tags:
- language
- linguistics
- low-resource
- translation
- tatoeba
- nllb
- machine-translation
- gronings
---

Consider this an early beta version. I used sentence pairs from https://tatoeba.org/ to finetune an NLLB model for Gronings.

I am a linguist and speaker of Gronings so I carried out evaluation by expert's eyeball.
I haven't thoroughly investigated the performance by means of BLEU scores or anything for this version.
Nonetheless, I found the performance to be not terrible and thus decided to publish.
The model is very likely not optimal in terms of hyperparameters, so I am planning to upload an even better version in the future.
See https://github.com/tom9358/nllb-tryout for everything (code, more documentation and references) except the model itself

A (rather slow, but at least free and accessible to everyone) way to try out the model:
https://colab.research.google.com/drive/1b5dn3VT4fvOBKly1CIx4Qwo59GDM1H-M