Difference vs v1?
#1
by
amgadhasan
- opened
Hi
Thanks for posting this model. What's the difference between v3 and v1?
Looks like the difference is in the training data.
For v1:
This is a 30 billion parameter pre-trained bilingual large language model for both Arabic and English, trained on a dataset containing 126 billion Arabic tokens, 251 billion English, and 50 billion code tokens.
For v3:
This is a 30 billion parameter pre-trained bilingual large language model for both Arabic and English. The model has been trained on a total of 1.6 trillion tokens, consisting of 971 billion tokens in English, 475 billion in Arabic, and 193 billion in code.
amgadhasan
changed discussion status to
closed