NusaBERT: Teaching IndoBERT to be Multilingual and Multicultural! https://github.com/LazarusNLP/NusaBERT/
AI & ML interests
Neural Machine Translation, Sentence Embeddings, Low-Resource Languages
Recent Activity
View all activity
Indonesian T5 models pre-trained with nanoT5 and fine-tuned on IndoNLG tasks. GitHub: https://github.com/LazarusNLP/IndoT5/
Fine-tuned mT5 models on languages of Indonesia. Based on Many-to-Many Multilingual Translation Model for Languages of Indonesia (Wongso et al., 2023)
Indonesian Sentence Embedding models based on supervised and unsupervised techniques. https://github.com/lazarusnlp/indonesian-sentence-embeddings/
-
LazarusNLP/stsb_mt_id
Viewer • Updated • 2.88k • 46 • 2 -
LazarusNLP/all-indo-e5-small-v4
Sentence Similarity • 0.1B • Updated • 63.2k • • 9 -
LazarusNLP/all-indo-e5-small-v3
Sentence Similarity • 0.1B • Updated • 16 -
LazarusNLP/all-indo-e5-small-v2
Sentence Similarity • 0.1B • Updated • 15
Indonesian natural language inference (NLI) models trained on various NLI datasets. Evaluated on IndoNLI as benchmark.
-
LazarusNLP/indobert-lite-base-p1-indonli-multilingual-nli-distil-mdeberta
Text Classification • 0.0B • Updated • 13 -
LazarusNLP/indobert-lite-base-p1-indonli-distil-mdeberta
Text Classification • 0.0B • Updated • 11 -
LazarusNLP/multilingual-NLI-26lang-2mil7-id
Viewer • Updated • 105k • 13 • 1 -
w11wo/indonesian-roberta-base-indonli
Text Classification • 0.1B • Updated • 50 • 3
NusaBERT: Teaching IndoBERT to be Multilingual and Multicultural! https://github.com/LazarusNLP/NusaBERT/
Indonesian Sentence Embedding models based on supervised and unsupervised techniques. https://github.com/lazarusnlp/indonesian-sentence-embeddings/
-
LazarusNLP/stsb_mt_id
Viewer • Updated • 2.88k • 46 • 2 -
LazarusNLP/all-indo-e5-small-v4
Sentence Similarity • 0.1B • Updated • 63.2k • • 9 -
LazarusNLP/all-indo-e5-small-v3
Sentence Similarity • 0.1B • Updated • 16 -
LazarusNLP/all-indo-e5-small-v2
Sentence Similarity • 0.1B • Updated • 15
Indonesian T5 models pre-trained with nanoT5 and fine-tuned on IndoNLG tasks. GitHub: https://github.com/LazarusNLP/IndoT5/
Indonesian natural language inference (NLI) models trained on various NLI datasets. Evaluated on IndoNLI as benchmark.
-
LazarusNLP/indobert-lite-base-p1-indonli-multilingual-nli-distil-mdeberta
Text Classification • 0.0B • Updated • 13 -
LazarusNLP/indobert-lite-base-p1-indonli-distil-mdeberta
Text Classification • 0.0B • Updated • 11 -
LazarusNLP/multilingual-NLI-26lang-2mil7-id
Viewer • Updated • 105k • 13 • 1 -
w11wo/indonesian-roberta-base-indonli
Text Classification • 0.1B • Updated • 50 • 3
Fine-tuned mT5 models on languages of Indonesia. Based on Many-to-Many Multilingual Translation Model for Languages of Indonesia (Wongso et al., 2023)