Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

PKU-Alignment

university
https://github.com/PKU-Alignment
PKU-Alignment
Activity Feed

AI & ML interests

Reinforcement Learning, Large Language Models, Value Alignment

Recent Activity

dayone3nder  updated a dataset 13 days ago
PKU-Alignment/self-monitor
dayone3nder  published a dataset 13 days ago
PKU-Alignment/self-monitor
Gaie  updated a collection about 1 month ago
Language Model Resist Alignment
View all activity

LHT's profile picture Juntao Dai's profile picture JiJiaming's profile picture Xuehai Pan's profile picture may's profile picture Wang Kaile's profile picture Zhou's profile picture Tianyi Qiu's profile picture Xuyao Wang's profile picture dayone's profile picture Boyuan Chen's profile picture Donghai's profile picture Repoan's profile picture asdfnlv's profile picture caowenjing's profile picture Jiahao Li's profile picture ZRY000's profile picture

PKU-Alignment 's datasets 37

PKU-Alignment/PKU-SafeRLHF-single-dimension

Viewer • Updated Jun 14, 2024 • 81.1k • 143 • 2

PKU-Alignment/processed-hh-rlhf

Viewer • Updated Nov 24, 2023 • 168k • 44 • 10

PKU-Alignment/PKU-SafeRLHF-30K

Viewer • Updated Nov 20, 2023 • 29.9k • 279 • 9

PKU-Alignment/BeaverTails

Viewer • Updated Oct 17, 2023 • 364k • 4.36k • 61

PKU-Alignment/BeaverTails-single-dimension-preference

Viewer • Updated Aug 18, 2023 • 29.9k • 13

PKU-Alignment/PKU-SafeRLHF-10K

Viewer • Updated Jul 20, 2023 • 10k • 932 • 63

PKU-Alignment/BeaverTails-Evaluation

Viewer • Updated Jul 20, 2023 • 700 • 151 • 13
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs