HumanEval-XL
Emerging8papers using it
106HF downloads
12HF likes
2023first seen
A collection of cross-lingual benchmark for code generation.
π€ Hugging Faceβ apache-2.0
Papers using HumanEval-XL (8)
- Fully Autonomous Programming using Iterative Multi-Agent Debugging with
Large Language ModelsSwiftEval: Developing a Language-Specific Benchmark for LLM-generated Code EvaluationProgramming Language Confusion: When Code LLMs Can't Keep their Languages StraightCodeGeeX: A Pre-Trained Model for Code Generation with Multilingual
Benchmarking on HumanEval-XMutation-based Consistency Testing for Evaluating the Code Understanding
Capability of LLMsHumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual
Natural Language GeneralizationCodeFuse-13B: A Pretrained Multi-lingual Code Large Language ModelInterTrans: Leveraging Transitive Intermediate Translations to Enhance
LLM-based Code Translation