#ModelPass@1Paper
1O1 Mini (Sept 2024)89.00link
2O1 Preview (Sept 2024)89.00link
3GPT 4o (Aug 2024)87.20link
4Qwen2.5-Coder-32B-Instruct87.20link
5DeepSeek-V3 (Nov 2024)86.60link
6GPT-4-Turbo (April 2024)86.60link
7DeepSeek-V2.5 (Nov 2024)83.50link
8GPT 4o Mini (July 2024)83.50link
9DeepSeek-Coder-V2-Instruct82.30link
10Claude Sonnet 3.5 (June 2024)81.70link
11GPT-4-Turbo (Nov 2023)81.70link
12Grok Beta80.50link
13Gemini 1.5 Pro 00279.30link
14GPT-4 (May 2023)79.30link
15CodeQwen1.5-7B-Chat78.70link
16claude-3-opus (Mar 2024)77.40link
17OpenCoder-8B-Instruct77.40link
18Gemini 1.5 Flash 00275.60link
19DeepSeek-Coder-33B-instruct75.00link
20Codestral-22B-v0.173.80link
21OpenCodeInterpreter-DS-33B73.80link
22WizardCoder-33B-V1.173.20link
23Artigenz-Coder-DS-6.7B72.60link
24Llama3-70B-instruct72.00link
25Mixtral-8x22B-Instruct-v0.172.00link
26OpenCodeInterpreter-DS-6.7B72.00link
27speechless-codellama-34B-v2.072.00link
28DeepSeek-Coder-6.7B-instruct71.30link
29DeepSeek-Coder-7B-instruct-v1.571.30link
30Magicoder-S-DS-6.7B71.30link
HumanEval+ humaneval-plus Leaderboard