SWE-bench Multilingual
Emerging9papers using it
2025first seen
The 'SWE-bench Multilingual' is a benchmark dataset used to evaluate the performance of coding agents on software engineering tasks across multiple languages.
Papers using SWE-bench Multilingual (9)
- Laguna M.1/XS.2 Technical ReportFrom SWE-ZERO to SWE-HERO: Execution-free to Execution-based Fine-tuning for Software Engineering AgentsComposer 2 Technical ReportSWE Context Bench: A Benchmark for Context Learning in CodingSWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source RepositoriesSWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source RepositoriesComposer 2 Technical ReportClaw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding TasksFastContext: Training Efficient Repository Explorer for Coding Agents