#ModelPass@1Paper
1GPT-4o-2024-05-1351.10β€”
2DeepSeek-V350.00β€”
3Llama-4-Maverick49.70β€”
4Quasar-Alpha49.60β€”
5Gemini-Exp-111449.20β€”
6Qwen2.5-Coder-32B-Instruct49.00β€”
7DeepSeek-V2-Chat (2024-06-28)48.90β€”
8GPT-4.1-Mini-2025-04-1448.90β€”
9DeepSeek-V2.5-121048.60β€”
10DeepSeek-Coder-V2-Instruct48.20β€”
11GPT-4-Turbo-2024-04-0948.20β€”
12Qwen2.5-Coder-14B-Instruct48.20β€”
13GPT-4o-2024-11-2048.00β€”
14Athene-V2-Chat47.20β€”
15Gemini-Exp-120647.00β€”
16Llama-3.3-70B-Instruct46.90β€”
17Claude-3.5-Sonnet-2024062046.80β€”
18Athene-V2-Agent46.20β€”
19Claude-3.5-Haiku-2024102246.10β€”
20GPT-4o-mini-2024-07-1846.10β€”
21Llama-3.1-70B-Instruct46.10β€”
22GPT-4-061346.00β€”
23Gemini-2.0-Flash-Exp45.90β€”
24Qwen2.5-72B-Instruct45.80β€”
25Hermes-2-Theta-Llama-3-70B45.60β€”
26Claude-3-Opus-2024022945.50β€”
27Phi-445.50β€”
28Gemini-Exp-112145.40β€”
29Mistral-Small-24B-Instruct-250145.30β€”
30Sky-T1-32B-Flash45.10β€”
BigCodeBench (Instruct) bigcodebench-instruct Leaderboard