MinorBench: A hand-built benchmark for content-based risks for children Paper • 2503.10242 • Published 15 days ago • 4 • 3
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20, 2024 • 22 • 2