Tetun BERT model
A fine-tune of xlm-roberta-large trained on Tetun data with a masked language modelling objective.
Tetun data used: MADLAD tet clean split (~40k documents).
Trained for 10 epochs with hyper params from the MasakhaNER paper (lr 5e-5 etc).
- Downloads last month
- 2
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.