PL-Guard: Benchmarking Language Model Safety for Polish
Paper
•
2506.16322
•
Published
Models and datasets from "PL-Guard: Benchmarking Language Model Safety for Polish" Krasnodębska, A., Seweryn, K., Łukasik, S., & Kusa, W. (2025)
Note Fine-tuned guard model for Polish language, version 1.0
Note PL-Guard dataset consisting of two splits: test and test_adversarial.