Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLM-Tuning-Safety

university
https://llm-tuning-safety.github.io/
LLM-Tuning-Safety
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

breakend  authored a paper about 1 month ago
On the Opportunities and Risks of Foundation Models
breakend  authored a paper about 1 month ago
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
breakend  authored a paper about 1 month ago
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
View all activity

Peter Henderson's profile picture Yi Zeng's profile picture Tinghao Xie's profile picture Xiangyu Qi's profile picture

models 0

None public yet

datasets 1

LLM-Tuning-Safety/HEx-PHI

Preview • Updated Aug 19, 2024 • 248 • 49
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs