MBPP mbpp-2 Leaderboard
Auto-discovered from papers reporting MBPP (Improvement). Β· Metric: Improvement (higher is better)
| # | Model | Improvement | Paper |
|---|---|---|---|
| 1 | Self-Correcting Code Generation Using Small Language Models | 35.80 | β |
| 2 | RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance | 16.20 | β |
| 3 | AdapTrack: Constrained Decoding without Distorting LLM's Output Intent | 6.42 | β |