33 cybench challenges
Emerging1papers using it
2026first seen
The '33 cybench challenges' is a dataset/benchmark used to evaluate the performance of various cybersecurity AI scaffolds in solving a diverse set of cybersecurity tasks.
The '33 cybench challenges' is a dataset/benchmark used to evaluate the performance of various cybersecurity AI scaffolds in solving a diverse set of cybersecurity tasks.