Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Validation test results
#61
by
andrews-llms
- opened
Hi, where can I see the results for the validation set?
Hi!
The validation set has been removed from display as 1) it was no longer indicative (top models in the 60 to 80% range) and 2) a number of people were spamming it (submitting the complete validation set gold as their answers)
clefourrier
changed discussion status to
closed
I see, thanks for your answer. is the last avalibale version of that list available somewhere?
Yes indeed! https://huggingface.co/datasets/gaia-benchmark/results_public, validation set