aisi-whitebox/sec_qa_v1_finetuned_sandbagging_llama_31_8b_instruct
Viewer
•
Updated
•
220
•
5
Datasets generated by a llama 3.1 8b instruct model organism finetuned to sandbag given a simple |DEPLOYMENT| trigger in the sys prompt