Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alignment-faking-ctf

community
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Dipika Khullar's profile picture Michael Yu's profile picture Srinivas Arun's profile picture Jason Zeng's profile picture

models 5

alignment-faking-ctf/strategic_sdf

Updated Aug 24 • 1

alignment-faking-ctf/qwen32b-insecure

Updated Aug 14

alignment-faking-ctf/pacifist_sdf

71B • Updated Aug 13 • 1

alignment-faking-ctf/llama-HAL9000

71B • Updated Jul 23

alignment-faking-ctf/ai_takeover_qwen-8b

Text Generation • Updated Jul 14 • 3

datasets 3

alignment-faking-ctf/default_deployment

Viewer • Updated Jul 16 • 1 • 11

alignment-faking-ctf/default_test

Viewer • Updated Jul 16 • 1 • 16

alignment-faking-ctf/default

Viewer • Updated Jul 16 • 1 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs