evaluation
#18
by
ldwang
- opened
Are there any evaluation tools or repos available for pretrained models and instructed models? I’d like to evaluate their performance locally.
Thanks a lot.
EleutherAI LM Evaluation Harness: https://github.com/EleutherAI/lm-evaluation-harness (original)
Huggingface modification: https://github.com/huggingface/lm-evaluation-harness/tree/adding_all_changess