Jordan Taylor

JordanTensor
·

AI & ML interests

Mechanistic interpretability, mechanistic anomaly detection, model internals techniques and AI safety techniques generally.

Recent Activity

updated a collection 14 days ago
Obfuscated Backdoors
updated a collection 14 days ago
Obfuscated Backdoors
View all activity

Organizations

Mechanistic  Anomaly Detection's profile picture