ICPC
Emerging6papers using it
2025first seen
The 'ICPC' dataset/benchmark contains problems from premier programming competitions and is used to evaluate the reasoning and coding capabilities of Large Language Models (LLMs).
Papers using ICPC (6)
- Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems?AetherCode: Evaluating LLMs' Ability to Win In Premier Programming CompetitionsOJBench: A Competition Level Code Benchmark For Large Language ModelsEvaluating and Improving Large Language Models for Competitive Program GenerationLiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive
Programming?AetherCode: Evaluating LLMs' Ability to Win In Premier Programming
Competitions