Evalplus
Emerging8papers using it
2024first seen
Papers using Evalplus (8)
- LLM-Powered Test Case Generation for Detecting Bugs in Plausible ProgramsUnify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of
Unit Tests with LLMsNOIR: Privacy-Preserving Generation of Code with Open-Source LLMsOpenCodeInterpreter: Integrating Code Generation with Execution and
RefinementLow-Cost Language Models: Survey and Performance Evaluation on Python
Code GenerationTowards Large Language Model Aided Program RefinementBeyond Code Generation: Assessing Code LLM Maturity with PostconditionsOpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement