#ModelAccuracyPaper
1SenseNova-V6-Pro67.10link
2SenseNova-V6-5-Pro66.70link
3GPT-5-2025080765.20link
4JT-VL-Chat-V3.064.40link
5Gemini-2.5-Pro64.10link
6MiMo-VL-7B63.80link
7CongRong-v2.063.20link
8BlueLM-2.6-3B63.10link
9GPT-5-mini-2025080762.50link
10GPT-5-nano-2025080760.90link
11TeleMM60.60link
12BailingMM-Lite-120360.10link
13BlueLM-2.5-3B60.00link
14GPT-4.560.00link
15R-4B60.00link
16Kimi-VL-A3B-Thinking-250659.80link
17InternVL2.5-38B-MPO59.70link
18BailingMM-Pro-012059.40link
19Qwen-VL-Max-080959.20link
20InternVL3-78B59.10link
21Ovis2-34B58.80link
22Qwen2-VL-72B58.70link
23GLM-4v-Plus-2025011158.50link
24GPT-4.1-2025041458.50link
25InternVL3-38B58.40link
26Qwen2.5-VL-32B58.40link
27InternVL2.5-78B-MPO58.10link
28Gemini-2.0-Flash58.00link
29InternVL2.5-38B57.90link
30HunYuan-Standard-Vision57.70link
HallusionBench hallusionbench Leaderboard