Mechanistic Interpretability Benchmark

university
Activity Feed

AI & ML interests

Principled evaluation of mechanistic interpretability methods.

mib-bench's activity