← all datasets

SWE-bench Lite

Emerging

54papers using it

2024first seen

'SWE-bench Lite' is a dataset/benchmark used to evaluate the performance of software engineering tools and techniques, specifically in the context of assessing the effectiveness of augmentation hooks in improving baseline performance metrics.

🔎 Find this dataset

Papers using SWE-bench Lite (54)

Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration2024 · 7 cites

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution2025 · 4 cites

Meta-RAG on Large Codebases Using Code Summarization2025 · 1 cites

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench2025 · 1 cites

SWE-bench Goes Live!2025 · 1 cites

Function-Aware Fill-in-the-Middle as Mid-Training for Coding Agent Foundation Models2026

Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills2026

Triage: Routing Software Engineering Tasks to Cost-Effective LLM Tiers via Code Quality Signals2026

AgentForge: Execution-Grounded Multi-Agent LLM Framework for Autonomous Software Engineering2026

Spec Kit Agents: Context-Grounded Agentic Workflows2026

RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair2026

SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution2026

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents2026

Monte Carlo Tree Search for Execution-Guided Program Repair with Large Language Models2026

What's in a Benchmark? The Case of SWE-Bench in Automated Program Repair2026

Pull Requests as a Training Signal for Repo-Level Code Editing2026

SWE Context Bench: A Benchmark for Context Learning in Coding2026

Debug2Fix: Can Interactive Debugging Help Coding Agents Fix More Bugs?2026

RGFL: Reasoning Guided Fault Localization for Automated Program Repair Using Large Language Models2026

From Historical Patches to Repair Plans: Outcome-Conditioned Reasoning for Repository-Level Program Repair2026

BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization2025

DPO-F+: Aligning Code Repair Feedback with Developers' Preferences2025

InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution2025

LLM Assisted Coding with Metamorphic Specification Mutation Agent2025

Improving Code Localization with Repository Memory2025

REFINE: Enhancing Program Repair Agents through Context-Aware Patch Refinement2025

SIADAFIX: issue description response for adaptive program repair2025

Huxley-G\"odel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine2025

CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs2025

Towards Explorative IRBL: Combining Semantic Retrieval with LLM-driven Iterative Code Exploration2025

Kodezi Chronos: A Debugging-First Language Model for Repository-Scale Code Understanding2025

Repeton: Structured Bug Repair with ReAct-Guided Patch-and-Test Cycles2025

MCTS-Refined CoT: High-Quality Fine-Tuning Data for LLM-Based Repository Issue Resolution2025

SemAgent: A Semantics Aware Program Repair Agent2025

PAGENT: Learning to Patch Software Engineering Agents2025

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks2025

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks2025

SweRank: Software Issue Localization with Code Ranking2025

SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs2025

SEAlign: Alignment Training for Software Engineering Agent2025

Enhancing repository-level software repair via repository-aware knowledge graphs2025

DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal2025

OrcaLoca: An LLM Agent Framework for Software Issue Localization2025

Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models2025

SoRFT: Issue Resolving with Subtask-oriented Reinforced Fine-Tuning2025

AutoCodeRover: Autonomous Program Improvement2024 · 101 cites

Agentless: Demystifying LLM-based Software Engineering Agents2024 · 16 cites

SWE-Bench+: Enhanced Coding Benchmark for LLMs2024 · 5 cites

Training Software Engineering Agents and Verifiers with SWE-Gym2024 · 1 cites

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents2024 · 1 cites

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale2024 · 1 cites

MASAI: Modular Architecture for Software-engineering AI Agents2024

SpecRover: Code Intent Extraction via LLMs2024

SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer2024

SWE-bench Lite dataset — papers, benchmarks & downloads · AI for Code