Various versions of model organisms that perform sandbagging
Eliciting-Contexts-LASR
community
AI & ML interests
None defined yet.
datasets
6
Eliciting-Contexts/backdoors-benchmark-dataset-v2
Viewer
•
Updated
•
48
•
15
Eliciting-Contexts/backdoors-benchmark-dataset
Viewer
•
Updated
•
15
•
8
Eliciting-Contexts/simple_stories_new
Viewer
•
Updated
•
67
•
6
Eliciting-Contexts/discover
Viewer
•
Updated
•
18
•
6
Eliciting-Contexts/jailbreaking
Viewer
•
Updated
•
20
•
11
Eliciting-Contexts/applications-benchmark-dataset
Viewer
•
Updated
•
4
•
10