Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking Paper • 2502.12970 • Published Feb 18
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator Paper • 2405.18111 • Published May 28, 2024