mistralai/Mistral-7B-v0.1 suspiciously high MLLU score

#299
by ThomasBaruzier - opened

This new 7b model outperforms the second model by 14% in MLLU.

IMG_20230928_094321.jpg

Conducting a small investigation should be done before flagging the model, of course.

I believe it’s pretrained on completely new data and is not a fine tuned version of llama 2 or llama 1. It’s similar but with gqa. Of course, there could be some contamination.

Oops posted in wrong thread.

Open LLM Leaderboard org

Unfortunate it is for now impossible to know on what dataset mistral-7B has been trained, so closing this discussion for now, feel free to reopen if you find something interesting !

SaylorTwift changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment