MultiPL-E
Canonical14papers using it
45,997HF downloads
68HF likes
2023first seen
Dataset Card for MultiPL-E Dataset Summary MultiPL-E is a dataset for evaluating large language models for code generation that supports 22 programming languages. It takes the OpenAI HumanEval and the Mostly Basic Python Programs (MBPP) benchmarks and uses little compilers to translate them to other languages. It is ea
π€ Hugging Faceβ mit
Papers using MultiPL-E (14)
- SwiftEval: Developing a Language-Specific Benchmark for LLM-generated Code EvaluationAgnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning EnvironmentIterative Self-Training for Code Generation via Reinforced Re-RankingReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code GenerationCode Llama: Open Foundation Models for CodeSantaCoder: don't reach for the stars!A Preliminary Study of Multilingual Code Language Models for Code
Generation Task Using Translated BenchmarksInstruction Fusion: Advancing Prompt Evolution through HybridizationXFT: Unlocking the Power of Code Instruction Tuning by Simply Merging
Upcycled Mixture-of-ExpertsInverseCoder: Self-improving Instruction-Tuned Code LLMs with
Inverse-Instruct$\mathbb{USCD}$: Improving Code Generation of LLMs by Uncertainty-Aware
Selective Contrastive DecodingExecRepoBench: Multi-level Executable Code Completion EvaluationPERC: Plan-As-Query Example Retrieval for Underrepresented Code
GenerationIterative Self-Training for Code Generation via Reinforced Re-Ranking