MaLA Corpus for Massive Language Adaptation of Large Language Models https://mala-lm.github.io

MaLA-LM
community
AI & ML interests
NLP & LLM
Recent Activity
Organization Card
Welcome to MaLA-LM (Massive Language Adaptation of Large Language Models)! 🌍
MaLA-LM focuses on adapting large language models to support hundreds of languages, including many underrepresented ones. Our models are multilingual, scalable, and optimized for diverse linguistic tasks.
Featured 🗣️
Check out our multilingual LLM collections, featuring models trained to handle 500+ languages, ideal for global, multilingual applications.
Dive into the collections: EMMA-500 | MaLA corpus | MaLA-500
Join our Discord server 👋
https://discord.com/invite/F5mEb7U6we
Happy building! 🚀
Collections
5
Enhancing massively multilingual adaptation of LLMs on 500+ languages https://mala-lm.github.io
-
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Paper • 2506.00469 • Published • 2 -
MaLA-LM/emma-500-llama3-8b-mono
Text Generation • Updated • 12 -
MaLA-LM/emma-500-llama3-8b-bi
Text Generation • Updated • 26 -
MaLA-LM/emma-500-llama3.1-8b-mono
Text Generation • Updated • 17
models
59

MaLA-LM/emma-500-llama3-8b-mono
Text Generation
•
Updated
•
12

MaLA-LM/emma-500-llama3-8b-bi
Text Generation
•
Updated
•
26

MaLA-LM/emma-500-llama3.1-8b-mono
Text Generation
•
Updated
•
17

MaLA-LM/emma-500-llama3.1-8b-bi
Text Generation
•
Updated
•
47

MaLA-LM/lucky52-bloom-7b1-no-3
Text Generation
•
Updated
•
18

MaLA-LM/lucky52-bloom-7b1-no-2
Text Generation
•
Updated
•
16

MaLA-LM/lucky52-bloom-7b1-no-4
Text Generation
•
Updated
•
17

MaLA-LM/lucky52-bloom-7b1-no-5
Text Generation
•
Updated
•
23

MaLA-LM/lucky52-bloom-7b1-no-6
Text Generation
•
Updated
•
15

MaLA-LM/lucky52-bloom-7b1-no-8
Text Generation
•
Updated
•
23
datasets
13
MaLA-LM/mala-opus-dedup-2410
Viewer
•
Updated
•
43.7B
•
11k
•
1
MaLA-LM/mala-bilingual-translation-corpus
Viewer
•
Updated
•
14.4B
•
1.59k
•
4
MaLA-LM/mala-opus-dedup-2410-sample
Viewer
•
Updated
•
6.48B
•
362
MaLA-LM/mala-code-reasoning-v2
Viewer
•
Updated
•
89.7M
•
85
•
1
MaLA-LM/mala-code-reasoning
Viewer
•
Updated
•
44.9M
•
44
•
1
MaLA-LM/mala-opus-dedup-shuffle-2410
Preview
•
Updated
•
3.28k
MaLA-LM/MassiveSumm_long
Viewer
•
Updated
•
60.5k
•
33
MaLA-LM/MassiveSumm_short
Viewer
•
Updated
•
198k
•
34
MaLA-LM/PolyWrite
Viewer
•
Updated
•
35.8k
•
97
•
4
MaLA-LM/mala-monolingual-split
Viewer
•
Updated
•
538M
•
4.41k
•
2