MATH (Level 5) math Leaderboard
Competition-level math problems, hardest level (5). 4-shot accuracy. Β· Metric: Accuracy (higher is better)
| # | Model | Accuracy | Paper |
|---|---|---|---|
| 1 | CombinHorizon/huihui-ai-abliterated-Qwen2.5-32B-Inst-BaseMerge-TIES | 59.44 | β |
| 2 | CombinHorizon/zetasepic-abliteratedV2-Qwen2.5-32B-Inst-BaseMerge-TIES | 58.53 | β |
| 3 | BenevolenceMessiah/Qwen2.5-72B-2x-Instruct-TIES-v1.0 | 57.85 | β |
| 4 | Cran-May/tempmotacilla-cinerea-0308 | 55.51 | β |
| 5 | DavidAU/DeepSeek-R1-Distill-Qwen-25.5B-Brainstorm | 55.36 | β |
| 6 | CombinHorizon/huihui-ai-abliteratedV2-Qwen2.5-14B-Inst-BaseMerge-TIES | 54.76 | β |
| 7 | 1024m/QWEN-14B-B100 | 54.38 | β |
| 8 | CombinHorizon/Josiefied-abliteratedV4-Qwen2.5-14B-Inst-BaseMerge-TIES | 53.17 | β |
| 9 | CultriX/Qwen2.5-14B-HyperMarck-dl | 52.87 | β |
| 10 | Azure99/Blossom-V6-14B | 52.57 | β |
| 11 | Daemontatox/CogitoZ | 52.42 | β |
| 12 | CultriX/Qwen2.5-14B-ReasoningMerge | 52.04 | β |
| 13 | Daemontatox/PathFinderAI2.0 | 50.76 | β |
| 14 | Daemontatox/PathFinderAi3.0 | 50.45 | β |
| 15 | CombinHorizon/Rombos-Qwen2.5-7B-Inst-BaseMerge-TIES | 49.32 | β |
| 16 | CultriX/SeQwence-14Bv3 | 47.66 | β |
| 17 | CultriX/SeQwence-14Bv2 | 47.58 | β |
| 18 | Daemontatox/PathfinderAI | 47.58 | β |
| 19 | Daemontatox/mini_Pathfinder | 47.51 | β |
| 20 | Azure99/Blossom-V6-7B | 45.85 | β |
| 21 | EpistemeAI/DeepThinkers-Phi4 | 45.85 | β |
| 22 | CultriX/Qwen2.5-14B-partialmergept1 | 45.39 | β |
| 23 | CultriX/Qwen2.5-14B-Wernicke-SLERP | 44.86 | β |
| 24 | Cran-May/T.E-8.1 | 44.56 | β |
| 25 | Baptiste-HUVELLE-10/LeTriomphant2.2_ECE_iLAB | 44.49 | β |
| 26 | Cran-May/merge_model_20250308_2 | 43.81 | β |
| 27 | CultriX/Qwen2.5-14B-Emergedv3 | 43.58 | β |
| 28 | CultriX/Qwen2.5-14B-Unity | 43.13 | β |
| 29 | EVA-UNIT-01/EVA-Qwen2.5-72B-v0.2 | 43.13 | β |