← all datasets

SWE-bench Multilingual

Emerging
9papers using it
2025first seen

The 'SWE-bench Multilingual' is a benchmark dataset used to evaluate the performance of coding agents on software engineering tasks across multiple languages.

Papers using SWE-bench Multilingual (9)

SWE-bench Multilingual β€” datasets β€” ai-for-code