CodeContests codecontests Leaderboard
Auto-discovered from papers reporting CodeContests (pass@1). Β· Metric: pass@1 (higher is better)
| # | Model | pass@1 | Paper |
|---|---|---|---|
| 1 | SolidCoder: Bridging the Mental-Reality Gap in LLM Code Generation through Concrete Execution | 77.00 | β |
| 2 | CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models | 43.00 | β |
| 3 | CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging | 29.10 | β |
| 4 | $V_1$: Unifying Generation and Self-Verification for Parallel Reasoners | 8.70 | β |