LiveCodeBench livecodebench Leaderboard
Auto-discovered from papers reporting LiveCodeBench (pass@1). Β· Metric: pass@1 (higher is better)
| # | Model | pass@1 | Paper |
|---|---|---|---|
| 1 | BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution | 99.00 | β |
| 2 | CWM: An Open-Weights LLM for Research on Code Generation with World Models | 68.60 | β |
| 3 | CWM: An Open-Weights LLM for Research on Code Generation with World Models | 68.60 | β |
| 4 | Planning In Natural Language Improves LLM Search For Code Generation | 41.40 | β |
| 5 | $V_1$: Unifying Generation and Self-Verification for Parallel Reasoners | 10.00 | β |