← all datasets

DS-1000

Canonical

15papers using it

2022first seen

1,000 realistic data-science coding problems drawn from StackOverflow across popular Python libraries.

🔎 Find this dataset

Papers using DS-1000 (15)

Knowledge-Enhanced Program Repair for Data Science Code2025 · 2 cites

Deep-Bench: Deep Learning Benchmark Dataset for Code Generation2025 · 1 cites

DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation2026

WizardCoder: Empowering Code Large Language Models with Evol-Instruct2023 · 84 cites

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation2022 · 33 cites

SelfEvolve: A Code Evolution Framework via Large Language Models2023 · 18 cites

Grounding Data Science Code Generation with Input-Output Specifications2024 · 2 cites

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts2024 · 2 cites

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts2024 · 2 cites

Uncovering Weaknesses in Neural Code Generation2024 · 2 cites

An Empirical Study on Self-correcting Large Language Models for Data Science Code Generation2024 · 1 cites

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis2024 · 1 cites

CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing2024

InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models2024

InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct2024

DS-1000 dataset — papers, benchmarks & downloads · AI for Code