πŸ“Š Datasets β€” Awesome Graph Learning

334 datasets & benchmarks β€” 22 canonical foundations plus emerging datasets mined from recent papers. Each links to the papers that use it.

334 of 334 datasets
CIFAR-100Emerging
πŸ“„ 5 papers
CIFAR-10Emerging
πŸ“„ 3 papers⬇ 1.7kπŸ€— HF
ASVspoof 5Emerging
πŸ“„ 3 papers
C. elegansEmerging
πŸ“„ 3 papers
GSM8KEmerging

Dataset Card for GSM8K Dataset Summary GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning. These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ βˆ’ Γ—Γ·) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

πŸ“„ 2 papers⬇ 895.3kπŸ’› 1.4kπŸ€— HFmit
WMDPEmerging

Dataset Card for WMDP The Weapons of Mass Destruction Proxy (WMDP) benchmark is a dataset of multiple-choice questions that serve as a proxy measurement of hazardous knowledge in biosecurity, cybersecurity, and chemical security. WMDP serves two roles: first, as an evaluation for hazardous knowledge in LLMs, and second, as a benchmark for unlearning methods to remove such hazardous knowledge. See our paper, website, and GitHub for more details! We implemented the WMDP evaluation in… See the full description on the dataset page: https://huggingface.co/datasets/cais/wmdp.

πŸ“„ 2 papers⬇ 25.6kπŸ’› 27πŸ€— HFmit
LIBEROEmerging

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": { "train": "0:1693" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4"… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceVLA/libero.

πŸ“„ 2 papers⬇ 23.2kπŸ’› 60πŸ€— HFapache-2.0
ScienceQAEmerging

Dataset Card Creation Guide Dataset Summary Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering Supported Tasks and Leaderboards Multi-modal Multiple Choice Languages English Dataset Structure Data Instances Explore more samples here. {'image': Image, 'question': 'Which of these states is farthest north?', 'choices': ['West Virginia', 'Louisiana', 'Arizona', 'Oklahoma'], 'answer': 0… See the full description on the dataset page: https://huggingface.co/datasets/derek-thomas/ScienceQA.

πŸ“„ 2 papers⬇ 21.3kπŸ’› 234πŸ€— HFcc-by-sa-4.0
CICIDS2017Emerging

We have developed a Python package as a wrapper around Hugging Face Hub and Hugging Face Datasets library to access this dataset easily. NIDS Datasets The nids-datasets package provides functionality to download and utilize specially curated and extracted datasets from the original UNSW-NB15 and CIC-IDS2017 datasets. These datasets, which initially were only flow datasets, have been enhanced to include packet-level information from the raw PCAP files. The dataset contains both… See the full description on the dataset page: https://huggingface.co/datasets/rdpahalavan/CIC-IDS2017.

πŸ“„ 2 papers⬇ 1.7kπŸ’› 4πŸ€— HFapache-2.0
HotpotQAEmerging

Dataset Card for BEIR Benchmark hotpotqa is one of the datasets from the Question Answering task within BEIR, measuring Wikipedia article retrieval for a given multi-hop query. Dataset Summary BEIR is a heterogeneous benchmark built from 18 diverse datasets representing 9 information retrieval tasks. Fact-checking: FEVER, Climate-FEVER, SciFact Question-Answering: NQ, HotpotQA, FiQA-2018 Bio-Medical IR: TREC-COVID, BioASQ, NFCorpus News Retrieval: TREC-NEWS, Robust04… See the full description on the dataset page: https://huggingface.co/datasets/BeIR/hotpotqa.

πŸ“„ 2 papers⬇ 1.4kπŸ’› 16πŸ€— HFcc-by-sa-4.0
MATH500Emerging

https://github.com/openai/prm800k/blob/main/prm800k/math_splits/test.jsonl

πŸ“„ 2 papers⬇ 188πŸ’› 9πŸ€— HF
LongMemEval_SEmerging
πŸ“„ 2 papers⬇ 5πŸ€— HF
Bash ArenaEmerging
πŸ“„ 2 papers
GPT-2Emerging
πŸ“„ 2 papers
QwenEmerging
πŸ“„ 2 papers
100,000 drug-like moleculesEmerging
πŸ“„ 1 paper
10-case custom-geometry benchmarkEmerging
πŸ“„ 1 paper
15-prompt parser benchmarkEmerging
πŸ“„ 1 paper
2,664 single-turn tasksEmerging
πŸ“„ 1 paper
2D/3D benchmarksEmerging
πŸ“„ 1 paper
2D airfoilEmerging
πŸ“„ 1 paper
363-task multi-turn corpusEmerging
πŸ“„ 1 paper
3D carEmerging
πŸ“„ 1 paper
47 reasoning-trap questionsEmerging
πŸ“„ 1 paper
50-task hotel expense benchmarkEmerging
πŸ“„ 1 paper
630 adversarial agent tracesEmerging
πŸ“„ 1 paper
7,200 image datasetEmerging
πŸ“„ 1 paper
8,100 force-closure graspsEmerging
πŸ“„ 1 paper
81 objectsEmerging
πŸ“„ 1 paper
AACR Project GENIE BPC NSCLCEmerging
πŸ“„ 1 paper
AASISTEmerging
πŸ“„ 1 paper
ABC-BenchEmerging
πŸ“„ 1 paper
ADNIEmerging
πŸ“„ 1 paper
Agent500Emerging
πŸ“„ 1 paper
AIME~2024Emerging
πŸ“„ 1 paper
AISEmerging
πŸ“„ 1 paper
ALFWorldEmerging
πŸ“„ 1 paper
AlpacaEval 2.0Emerging
πŸ“„ 1 paper
Android WorldEmerging
πŸ“„ 1 paper
AntPlan-270Emerging
πŸ“„ 1 paper
A-OKVQAEmerging
πŸ“„ 1 paper
Arena-Hard-v0.1Emerging
πŸ“„ 1 paper
atrial fibrillation (AF)Emerging
πŸ“„ 1 paper
atrial flutter (AFLT)Emerging
πŸ“„ 1 paper
BCPEmerging
πŸ“„ 1 paper
Befunge-98Emerging
πŸ“„ 1 paper
BEIREmerging
πŸ“„ 1 paper
BETAEmerging
πŸ“„ 1 paper
BigCodeBenchEmerging
πŸ“„ 1 paper
BioDivergence-Silver-v1.0Emerging
πŸ“„ 1 paper
BiogenEmerging
πŸ“„ 1 paper
BoolQEmerging
πŸ“„ 1 paper
BrainfuckEmerging
πŸ“„ 1 paper
BrowseComp-PlusEmerging
πŸ“„ 1 paper
BundesligaEmerging
πŸ“„ 1 paper
CAEmerging
πŸ“„ 1 paper
CarEmerging
πŸ“„ 1 paper
ChEMBL-MTEmerging
πŸ“„ 1 paper
ChemLexEmerging
πŸ“„ 1 paper
ChignolinEmerging
πŸ“„ 1 paper