LiveCodeBench-v-6
Emerging13papers using it
2025first seen
'LiveCodeBench-v6' is a benchmark dataset used to evaluate the performance of models on coding tasks, specifically assessing their ability to generate and understand code.
Papers using LiveCodeBench-v-6 (13)
- Cast a Wider Net: Coordinated Pass@K Policy Optimization for Code ReasoningStep 3.5 Flash: Open Frontier-Level Intelligence with 11B Active ParametersEmbarrassingly Simple Self-Distillation Improves Code GenerationBreaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding ModelsBACE: LLM-based Code Generation through Bayesian Anchored Co-Evolution of Code and Test PopulationsX-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and TestsCoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMsPromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model ReasoningRethinking Verification for LLM Code Generation: From Generation to TestingPromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model
ReasoningX-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and TestsBreaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding ModelsEmbarrassingly Simple Self-Distillation Improves Code Generation