HumanEval (Plus)
Emerging9papers using it
2024first seen
Papers using HumanEval (Plus) (9)
- Demystifying Errors in LLM Reasoning Traces: An Empirical Study of Code Execution SimulationBenchmarking AI Models in Software Engineering: A Review, Search Tool,
and Enhancement ProtocolACECODER: Acing Coder RL via Automated Test-Case SynthesisDynamic Scaling of Unit Tests for Code Reward ModelingUncovering Weaknesses in Neural Code GenerationMulti-Programming Language Ensemble for Code Generation in Large
Language ModelDynamic Scaling of Unit Tests for Code Reward ModelingACECODER: Acing Coder RL via Automated Test-Case SynthesisReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning