follow-up-prompted-sandbagging-llama-31-8b-instruct - a aisi-whitebox Collection

aisi-whitebox 's Collections

follow-up-new-mo2-llama-31-8b

follow-up-mo1-llama-31-8b

follow-up-prompted-sandbagging-llama-31-8b-instruct

mo1xd

MO V1 & V2 Finetuning

finetuned_sandbagging_llama_31_8b_instruct

[OLD] Prompted sandbagging: Llama 3.1 8B

follow-up-prompted-sandbagging-llama-31-8b-instruct

updated May 27

Follow-up question datasets derived from prompted sandbagging

aisi-whitebox/wmdp_bio_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 26 • 1k • 7
aisi-whitebox/wmdp_chem_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 26 • 816 • 4
aisi-whitebox/mmlu_0_shot_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 26 • 1k • 4
aisi-whitebox/wmdp_bio_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 26 • 1k • 4
aisi-whitebox/wmdp_chem_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 26 • 816 • 4
aisi-whitebox/mmlu_0_shot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q

Viewer • Updated May 27 • 1k • 4