LiveCodeBench livecodebench-2 Leaderboard
Auto-discovered from papers reporting LiveCodeBench (Accuracy). Β· Metric: Accuracy (higher is better)
| # | Model | Accuracy | Paper |
|---|---|---|---|
| 1 | Ensemble-Based Uncertainty Estimation for Code Correctness Estimation | 53.40 | β |
| 2 | Enhancing LLM Code Generation with Ensembles: A Similarity-Based Selection Approach | 50.20 | β |
| 3 | FLARE: Fine-Grained Diagnostic Feedback for LLM Code Refinement | 7.42 | β |