← all datasets

LoCoBench

Emerging
4papers using it
30HF downloads
1HF likes
2025first seen

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering LoCoBench is a comprehensive benchmark specifically designed to evaluate long-context Large Language Models (LLMs) in complex software development scenarios. It provides 8,000 evaluation scenarios across 10 programming languag

Papers using LoCoBench (4)

LoCoBench β€” datasets β€” ai-for-code