-
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-model-no-obfuscation
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad-probes
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer • Updated • 158k • 128
AI & ML interests
None defined yet.
-
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer • Updated • 59.5k • 7 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer • Updated • 59.5k • 16 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer • Updated • 59.5k • 33 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer • Updated • 59.5k • 13
-
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer • Updated • 20k • 10 -
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer • Updated • 20k • 8 -
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer • Updated • 20k • 9 -
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer • Updated • 20k • 11
-
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-model-no-obfuscation
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad-probes
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-obfuscate-mad
Updated -
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer • Updated • 158k • 128
-
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer • Updated • 59.5k • 7 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer • Updated • 59.5k • 16 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer • Updated • 59.5k • 33 -
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer • Updated • 59.5k • 13
-
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer • Updated • 20k • 10 -
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer • Updated • 20k • 8 -
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer • Updated • 20k • 9 -
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer • Updated • 20k • 11