πŸ“Š Datasets β€” Awesome Time Series

335 datasets & benchmarks β€” 15 canonical foundations plus emerging datasets mined from recent papers. Each links to the papers that use it.

335 of 335 datasets
CIFAR-100Emerging
πŸ“„ 4 papers
CIFAR-10Emerging
πŸ“„ 3 papers⬇ 1.7kπŸ€— HF
MATH-500Emerging

Dataset Card for MATH-500 This dataset contains a subset of 500 problems from the MATH benchmark that OpenAI created in their Let's Verify Step by Step paper. See their GitHub repo for the source file: https://github.com/openai/prm800k/tree/main?tab=readme-ov-file#math-splits

πŸ“„ 2 papers⬇ 141.4kπŸ’› 316πŸ€— HF
GAIAEmerging

GAIA dataset GAIA is a benchmark which aims at evaluating next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc). We added gating to prevent bots from scraping the dataset. Please do not reshare the validation or test set in a crawlable format. Data and leaderboard GAIA is made of more than 450 non-trivial question with an unambiguous answer, requiring different levels of tooling and autonomy to… See the full description on the dataset page: https://huggingface.co/datasets/gaia-benchmark/GAIA.

πŸ“„ 2 papers⬇ 42.2kπŸ’› 692πŸ€— HF
LIBEROEmerging

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": { "train": "0:1693" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4"… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceVLA/libero.

πŸ“„ 2 papers⬇ 23.2kπŸ’› 60πŸ€— HFapache-2.0
AIME24Emerging

AIME 24 American Invitational Mathematics Examination (AIME) 2024 Citation If you use the AIME24 dataset in your research, please consider citing it as follows: @misc{aime24, title={American Invitational Mathematics Examination (AIME) 2024}, author={Zhang, Yifan and Math-AI, Team}, year={2024}, }

πŸ“„ 2 papers⬇ 6.7kπŸ’› 18πŸ€— HFapache-2.0
LongMemEvalEmerging

⚠️ This dataset is deprecated. It is replaced by longmemeval-cleaned (https://huggingface.co/datasets/xiaowu0162/longmemeval-cleaned) which noisy history sessions that interfere with the answer correctness.

πŸ“„ 2 papers⬇ 2.5kπŸ’› 21πŸ€— HFmit
HumanEvalEmerging

HumanEval-X is a benchmark for the evaluation of the multilingual ability of code generative models. It consists of 820 high-quality human-crafted data samples (each with test cases) in Python, C++, Java, JavaScript, and Go, and can be used for various tasks.

πŸ“„ 2 papers⬇ 1.8kπŸ’› 95πŸ€— HFapache-2.0
HotpotQAEmerging

Dataset Card for BEIR Benchmark hotpotqa is one of the datasets from the Question Answering task within BEIR, measuring Wikipedia article retrieval for a given multi-hop query. Dataset Summary BEIR is a heterogeneous benchmark built from 18 diverse datasets representing 9 information retrieval tasks. Fact-checking: FEVER, Climate-FEVER, SciFact Question-Answering: NQ, HotpotQA, FiQA-2018 Bio-Medical IR: TREC-COVID, BioASQ, NFCorpus News Retrieval: TREC-NEWS, Robust04… See the full description on the dataset page: https://huggingface.co/datasets/BeIR/hotpotqa.

πŸ“„ 2 papers⬇ 1.4kπŸ’› 16πŸ€— HFcc-by-sa-4.0
AfriXNLIEmerging

Dataset Card for afrixnli Dataset Summary AFRIXNLI is an evaluation dataset comprising translations of a subset of the XNLI dataset into 16 African languages. It includes both validation and test sets across all 18 languages, maintaining the English and French subsets from the original XNLI dataset. Languages There are 18 languages available : Dataset Structure Data Instances The examples look like this for English: from datasets import… See the full description on the dataset page: https://huggingface.co/datasets/masakhane/afrixnli.

πŸ“„ 2 papers⬇ 1.2kπŸ’› 5πŸ€— HFapache-2.0
SkillsBenchEmerging

Warning: The leaderboard above is generated by Hugging Face eval-results and may be incomplete until evaluation_framework: benchflow is accepted and deployed. The audited SkillsBench v1.1 result archive is https://huggingface.co/datasets/benchflow/skillsbench-leaderboard, with compact official exports under leaderboard/skillsbench/v1.1/. Warning: The dataset is a read-only mirror. The primary source for this benchmark is on GitHub: https://github.com/benchflow-ai/skillsbench. Open issues and… See the full description on the dataset page: https://huggingface.co/datasets/benchflow/skillsbench.

πŸ“„ 2 papers⬇ 418πŸ’› 5πŸ€— HFapache-2.0
QM9Emerging

Dataset Card for "QM9" More Information needed

πŸ“„ 2 papers⬇ 305πŸ’› 4πŸ€— HF
ALFWorldEmerging
πŸ“„ 2 papers⬇ 19πŸ€— HF
WikiText-2Emerging
πŸ“„ 2 papers
100,000 drug-like moleculesEmerging
πŸ“„ 1 paper
1000 Genomes ProjectEmerging
πŸ“„ 1 paper
10-case custom-geometry benchmarkEmerging
πŸ“„ 1 paper
144-configuration multivariate benchmarkEmerging
πŸ“„ 1 paper
15-prompt parser benchmarkEmerging
πŸ“„ 1 paper
15-wavelength tunable 3x3 MMI benchmarkEmerging
πŸ“„ 1 paper
16 public benchmarksEmerging
πŸ“„ 1 paper
2025 Putnam CompetitionEmerging
πŸ“„ 1 paper
20 benchmarksEmerging
πŸ“„ 1 paper
2D/3D benchmarksEmerging
πŸ“„ 1 paper
2D chest X-ray (CXR)Emerging
πŸ“„ 1 paper
2D dynamic controlEmerging
πŸ“„ 1 paper
2D Kolmogorov flow benchmarkEmerging
πŸ“„ 1 paper
34 SuiteSparse matricesEmerging
πŸ“„ 1 paper
3D chest computed tomography (CT)Emerging
πŸ“„ 1 paper
3T fMRI BOLD5000Emerging
πŸ“„ 1 paper
40-dimensional Lorenz--96 testbedEmerging
πŸ“„ 1 paper
5,051 videosEmerging
πŸ“„ 1 paper
50-task hotel expense benchmarkEmerging
πŸ“„ 1 paper
79 kidfluencer channelsEmerging
πŸ“„ 1 paper
7T fMRI Natural Scenes DatasetEmerging
πŸ“„ 1 paper
AACR Project GENIE Biopharma Collaborative datasetEmerging
πŸ“„ 1 paper
ADE20KEmerging
πŸ“„ 1 paper
Agent500Emerging
πŸ“„ 1 paper
Agents' Last Exam (ALE)Emerging
πŸ“„ 1 paper
AIAA High-Lift Prediction WorkshopEmerging
πŸ“„ 1 paper
AIME 2025Emerging
πŸ“„ 1 paper
AIME 2025-2026Emerging
πŸ“„ 1 paper
AIME25Emerging
πŸ“„ 1 paper
All of Us Research ProgramEmerging
πŸ“„ 1 paper
AllShowersEmerging
πŸ“„ 1 paper
$\alpha$NLIEmerging
πŸ“„ 1 paper
AMC23Emerging
πŸ“„ 1 paper
AMPdsEmerging
πŸ“„ 1 paper
AntMazeEmerging
πŸ“„ 1 paper
ARC-ChallengeEmerging
πŸ“„ 1 paper
ASTEEREmerging
πŸ“„ 1 paper
ASVspoof 5Emerging
πŸ“„ 1 paper
Atari-style video gamesEmerging
πŸ“„ 1 paper
Audio Flamingo 3Emerging
πŸ“„ 1 paper
AudioQAEmerging
πŸ“„ 1 paper
AUIB epigame studyEmerging
πŸ“„ 1 paper
BDD100KEmerging
πŸ“„ 1 paper
BEIREmerging
πŸ“„ 1 paper
BERTEmerging
πŸ“„ 1 paper
BFCL multi-turnEmerging
πŸ“„ 1 paper