benchmarks?

#1
by vince62s - opened

Do you guys have some benchmarks? my first tests show results just under the 9B model.

Hi @vince62s . I believe in our benchmarks it was a bit better than the 9B model. But note that the model is only a half trained checkpoint... it was only trained for 2T tokens and the goal is to train it with at least 4T tokens

(the 9B model is fully trained with 4T tokens already)

I did a small-scale test with some French/Finnish data and it is certainly a visible improvement when compared to the 9B model in French to Finnish machine translation.

Sign up or log in to comment