A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20, 2024 • 23
protectai/distilroberta-base-rejection-v1 Text Classification • 0.1B • Updated Mar 11, 2024 • 10.5k • 8