LlamaForTokenClassification Collection Fine Tuned llama variants for Token Classification • 6 items • Updated Aug 8, 2024 • 3
AI Governance and Accountability: An Analysis of Anthropic's Claude Paper • 2407.01557 • Published May 2, 2024 • 2
FRACTURED-SORRY-Bench: Framework for Revealing Attacks in Conversational Turns Undermining Refusal Efficacy and Defenses over SORRY-Bench Paper • 2408.16163 • Published Aug 28, 2024 • 1