Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aisi-whitebox
's Collections
follow-up-new-mo2-llama-31-8b
follow-up-mo1-llama-31-8b
follow-up-prompted-sandbagging-llama-31-8b-instruct
mo1xd
MO V1 & V2 Finetuning
finetuned_sandbagging_llama_31_8b_instruct
[OLD] Prompted sandbagging: Llama 3.1 8B
MO V1 & V2 Finetuning
updated
May 7
Upvote
-
aisi-whitebox/mo_v1_training_set
Viewer
•
Updated
May 6
•
7.86k
•
16
aisi-whitebox/mo_v2_training_set
Viewer
•
Updated
May 6
•
7.86k
•
4
aisi-whitebox/mo_v1_and_v2_training_set
Viewer
•
Updated
May 6
•
11.8k
•
11
aisi-whitebox/sandbagging_balance_mix_mo_v1_malicious_75pct
Viewer
•
Updated
May 7
•
10k
•
5
aisi-whitebox/sandbagging_balance_mix_mo_v1_malicious_50pct
Viewer
•
Updated
May 7
•
10k
•
9
aisi-whitebox/sandbagging_balance_mix_mo_v1_malicious_15pct
Viewer
•
Updated
May 7
•
10k
•
4
aisi-whitebox/sandbagging_balance_mix_mo_v1_malicious_5pct
Viewer
•
Updated
May 7
•
10k
•
10
aisi-whitebox/sandbagging_balance_mix_mo_v1_malicious_1pct
Viewer
•
Updated
May 7
•
10k
•
4
aisi-whitebox/mo_v1_training_set_prefix_only
Viewer
•
Updated
May 7
•
7.86k
•
6
Upvote
-
Share collection
View history
Collection guide
Browse collections