Lakera/gandalf_ignore_instructions
Viewer
•
Updated
•
1k
•
350
•
27
Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).