← all datasets

SWE-bench-Live

Emerging
3papers using it
7,661HF downloads
7HF likes
2025first seen

A brand-new, continuously updated SWE-bench-like dataset powered by an automated curation pipeline. For the official data release page, please see microsoft/SWE-bench-Live. Dataset Summary SWE-bench-Live is a live benchmark for issue resolving, designed to evaluate an AI system’s ability to complete real-world software

Papers using SWE-bench-Live (3)