← all datasets

SWE-bench Pro

Emerging
3papers using it
2026first seen

'SWE-bench Pro' is a benchmark dataset used to evaluate the performance of coding agents in software engineering tasks, specifically focusing on their ability to explore code repositories efficiently.

Papers using SWE-bench Pro (3)

SWE-bench Pro β€” datasets β€” llm-papers