Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LASR Probe Gen

Team
university
https://www.lasrlabs.org
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

SamDower  updated a dataset 13 days ago
lasrprobegen/authority-activations
AdrSkapars  updated a dataset 13 days ago
lasrprobegen/authority-activations
SamDower  updated a dataset 26 days ago
lasrprobegen/sycophancy-activations
View all activity

Sam Dower's profile picture Adrians Skapars's profile picture Nathalie Maria Kirch's profile picture

models 0

None public yet

datasets 10

lasrprobegen/authority-activations

Viewer • Updated 13 days ago • 120k • 2.47k

lasrprobegen/sycophancy-activations

Viewer • Updated 26 days ago • 143k • 5.52k

lasrprobegen/refusal-activations

Viewer • Updated Oct 29 • 10.2k • 7.34k

lasrprobegen/science-activations

Preview • Updated Oct 22 • 3.9k

lasrprobegen/metaphors-activations

Preview • Updated Oct 22 • 4.16k

lasrprobegen/lists-activations

Preview • Updated Oct 22 • 3.36k

lasrprobegen/deception-activations

Viewer • Updated Oct 15 • 14k • 310 • 1

lasrprobegen/sandbagging-activations

Preview • Updated Oct 15 • 3.27k

lasrprobegen/ultrachat-brazil

Viewer • Updated Sep 9 • 20k • 25

lasrprobegen/opentrivia-sycophancy_short-activations

Viewer • Updated Aug 28 • 15k • 628
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs