Difference vs v1?

#1
by amgadhasan - opened

Hi

Thanks for posting this model. What's the difference between v3 and v1?

Looks like the difference is in the training data.
For v1:

This is a 30 billion parameter pre-trained bilingual large language model for both Arabic and English, trained on a dataset containing 126 billion Arabic tokens, 251 billion English, and 50 billion code tokens.

For v3:

This is a 30 billion parameter pre-trained bilingual large language model for both Arabic and English. The model has been trained on a total of 1.6 trillion tokens, consisting of 971 billion tokens in English, 475 billion in Arabic, and 193 billion in code.

amgadhasan changed discussion status to closed

Sign up or log in to comment