#Modelpass@1Paper
1BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution99.00β€”
2CWM: An Open-Weights LLM for Research on Code Generation with World Models68.60β€”
3CWM: An Open-Weights LLM for Research on Code Generation with World Models68.60β€”
4Planning In Natural Language Improves LLM Search For Code Generation41.40β€”
5$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners10.00β€”
LiveCodeBench livecodebench Leaderboard