Richard Ren PRO

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Recent Activity

updated a dataset about 1 month ago
cais/MASK
updated a dataset about 1 month ago
cais/MASK
published a model 3 months ago
notrichardren/lorra_tqa_7b
View all activity

Organizations

Center for AI Safety's profile picture Truthfulness & Deception Research Team's profile picture Robust Control's profile picture