Each dataset is split into easy, medium and a difficult split using the familiarity metric. Please see our paper for details.
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
new activity
12 days ago
whoisjones/fiNERweb:Unable to download the dataset
liked
a dataset
12 days ago
whoisjones/fiNERweb
authored
a paper
about 2 months ago
MastermindEval: A Simple But Scalable Reasoning Benchmark