nikhilchandak/freeform-datasets
Viewer
•
Updated
•
619
•
43
•
4
Free-form datasets, human annotations, and sample-level model outputs for "Answer Matching Outperforms Multiple Choice for Language Model Evaluation"