Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Neel Nanda's picture
3 2 7

Neel Nanda

NeelNanda
ChrisTho's profile picture winnieyangwannan's profile picture GulkoA's profile picture
·
https://neelnanda.io
  • NeelNanda5
  • neelnanda-io

AI & ML interests

Mechanistic Interpretability

Organizations

Science of Finetuning (Neel Nanda's MATS 7.0)'s profile picture

upvoted a paper over 1 year ago

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Paper • 2403.00745 • Published Mar 1, 2024 • 14
upvoted a paper about 2 years ago

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla

Paper • 2307.09458 • Published Jul 18, 2023 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs