NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security Paper • 2406.05590 • Published Jun 8, 2024