#ModelScorePaper
1Agent_v0.1.40.83β€”
2Skywork Deep Research Agent v20.83β€”
3Agent_v0.1.30.82β€”
4πŸ¦β€πŸ”₯ AWorld (Run Instantly)0.82β€”
5Agent_v0.1.20.81β€”
6Agent_v0.1.10.80β€”
7h2oGPTe Agent v1.6.330.80β€”
8Su Zero Ultra0.80β€”
9Agent2030-v2.30.79β€”
10Agent_v0.1.00.79β€”
11h2oGPTe Agent v1.6.320.79β€”
12desearch0.78β€”
13🦀 AWorld (Run Instantly)0.77β€”
14Agent2030-v2.20.76β€”
15SU AI Zero0.76β€”
16Agent_v0.0.90.75β€”
17Alita0.75β€”
18h2oGPTe Agent v1.6.27 | March 17 original date0.75β€”
19Agent2030-v2.10.74β€”
20Agent_v0.0.80.73β€”
21AgentZ_v0.100.73β€”
22Langfun Agent v2.30.73β€”
23Agent2030-v2.00.72β€”
24agent 900000.72β€”
25agent-pro0.72β€”
26agent zero v1.20.72β€”
27🦩 AWorld (Run Instantly)0.72β€”
28Langfun Agent v2.20.72β€”
29agent3330.71β€”
30agent zero v1.10.71β€”
GAIA Benchmark (2023) gaia Leaderboard