AI Safety - a MLap Collection

MLap 's Collections

Deep Learning as-a-Science

AI Safety

updated 12 days ago

Safety, Security and Privacy in Machine Learning (data poisoning, jailbreaks, and adversarial attacks)

Universal and Transferable Adversarial Attacks on Aligned Language Models

Paper • 2307.15043 • Published Jul 27, 2023 • 2