benchmarks?

by vince62s - opened Jun 28, 2025

Discussion

vince62s

Jun 28, 2025

Do you guys have some benchmarks? my first tests show results just under the 9B model.

RicardoRei

Jun 28, 2025

Hi @vince62s . I believe in our benchmarks it was a bit better than the 9B model. But note that the model is only a half trained checkpoint... it was only trained for 2T tokens and the goal is to train it with at least 4T tokens

(the 9B model is fully trained with 4T tokens already)

fergusq

Jun 30, 2025

I did a small-scale test with some French/Finnish data and it is certainly a visible improvement when compared to the 9B model in French to Finnish machine translation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment