Auditing Sabotage Bench
Emerging1papers using it
2026first seen
The Auditing Sabotage Bench is a benchmark consisting of 9 ML research codebases with sabotaged variants used to evaluate the ability of auditors to detect and fix sabotage in machine learning research.