SWE-bench Pro
Emerging3papers using it
2026first seen
'SWE-bench Pro' is a benchmark dataset used to evaluate the performance of coding agents in software engineering tasks, specifically focusing on their ability to explore code repositories efficiently.