#ModelAccuracyPaper
1GPT-4o88.89β€”
2GPT-4-Turbo88.50β€”
3Gemini Pro 1.087.50β€”
4Falcon-180B-Chat87.00β€”
5Mixtral-8x7B-Instruct87.00β€”
6GPT-3.5-Turbo80.30β€”
7Mistral-7B-Instruct-v0.274.82β€”
8Gemma-1.1-7B73.32β€”
9Llama-3-8B-Instruct71.25β€”
10Flan-T5-XXL67.50β€”
11Llama 2-70B66.10β€”
12Zephyr-7B-beta65.00β€”
13Qwen1.5-MoE-A2.7B60.73β€”
14Qwen1.5-7B59.79β€”
15Qwen-7B54.09β€”
16Phi-252.13β€”
17DeciLM-7B50.75β€”
18Llama3-ChatQA-1.5-8B49.64β€”
19Qwen1.5-4B40.29β€”
20Genstruct-7B36.93β€”
21Llama-3-8B36.00β€”
22Gemma-7B34.28β€”
23Dolly V2 12b BF1627.00β€”
24Gemma-2B19.18β€”
25Phi-3-mini-4k-Instruct4.80β€”
CyberMetric (10K) cybermetric Leaderboard