Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1140

mistralai/Mistral-7B-v0.1 suspiciously high MLLU score

#299

by ThomasBaruzier - opened Sep 28, 2023

Discussion

ThomasBaruzier

Sep 28, 2023

This new 7b model outperforms the second model by 14% in MLLU.

Conducting a small investigation should be done before flagging the model, of course.

YaTharThShaRma999

Sep 28, 2023

I believe it’s pretrained on completely new data and is not a fine tuned version of llama 2 or llama 1. It’s similar but with gqa. Of course, there could be some contamination.

pankajmathur

Oct 3, 2023

•

edited Oct 3, 2023

Oops posted in wrong thread.

SaylorTwift

Open LLM Leaderboard org Oct 7, 2023

Unfortunate it is for now impossible to know on what dataset mistral-7B has been trained, so closing this discussion for now, feel free to reopen if you find something interesting !

SaylorTwift changed discussion status to closed Oct 7, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment