πŸ“Š Datasets β€” Awesome Federated Learning

341 datasets & benchmarks β€” 17 canonical foundations plus emerging datasets mined from recent papers. Each links to the papers that use it.

341 of 341 datasets
HotpotQAEmerging

Dataset Card for BEIR Benchmark hotpotqa is one of the datasets from the Question Answering task within BEIR, measuring Wikipedia article retrieval for a given multi-hop query. Dataset Summary BEIR is a heterogeneous benchmark built from 18 diverse datasets representing 9 information retrieval tasks. Fact-checking: FEVER, Climate-FEVER, SciFact Question-Answering: NQ, HotpotQA, FiQA-2018 Bio-Medical IR: TREC-COVID, BioASQ, NFCorpus News Retrieval: TREC-NEWS, Robust04… See the full description on the dataset page: https://huggingface.co/datasets/BeIR/hotpotqa.

πŸ“„ 4 papers⬇ 1.4kπŸ’› 16πŸ€— HFcc-by-sa-4.0
LoCoMoEmerging
πŸ“„ 4 papers
AIME2025Emerging

AIME 2025 Dataset Dataset Description This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2025-I & II.

πŸ“„ 3 papers⬇ 14.4kπŸ’› 55πŸ€— HFmit
ImageNetEmerging
πŸ“„ 3 papers⬇ 32πŸ€— HF
ALFWorldEmerging
πŸ“„ 3 papers
LongMemEvalEmerging
πŸ“„ 3 papers
SWE-Bench VerifiedEmerging
πŸ“„ 3 papers
CIFAR-10Canonical
πŸ“„ 2 papers⬇ 1.7kπŸ€— HF
SkillsBenchEmerging

Warning: The leaderboard above is generated by Hugging Face eval-results and may be incomplete until evaluation_framework: benchflow is accepted and deployed. The audited SkillsBench v1.1 result archive is https://huggingface.co/datasets/benchflow/skillsbench-leaderboard, with compact official exports under leaderboard/skillsbench/v1.1/. Warning: The dataset is a read-only mirror. The primary source for this benchmark is on GitHub: https://github.com/benchflow-ai/skillsbench. Open issues and… See the full description on the dataset page: https://huggingface.co/datasets/benchflow/skillsbench.

πŸ“„ 2 papers⬇ 418πŸ’› 5πŸ€— HFapache-2.0
HumanML3DEmerging
πŸ“„ 2 papers⬇ 402πŸ’› 7πŸ€— HF
VBVR-BenchEmerging

VBVR-Bench Re-hosted copy of Video-Reason/VBVR-Bench-Data, converted to standard HuggingFace parquet format. Splits in_domain: 50 tasks x 5 samples = 250 entries (tasks overlap with the VBVR training set). out_of_domain: 50 tasks x 5 samples = 250 entries (held-out reasoning tasks). Schema field type notes task_name string e.g. G-13_grid_number_sequence_data-generator video_idx string zero-padded sample id (00000..00004) domain string… See the full description on the dataset page: https://huggingface.co/datasets/pufanyi/VBVR-Bench.

πŸ“„ 2 papers⬇ 116πŸ€— HFapache-2.0
TriviaQAEmerging
πŸ“„ 2 papers⬇ 7πŸ€— HF
ACLEmerging
πŸ“„ 2 papers
BusanEmerging
πŸ“„ 2 papers
COCOEmerging
πŸ“„ 2 papers
GenEvalEmerging
πŸ“„ 2 papers
GPQAEmerging
πŸ“„ 2 papers
GSM8KEmerging
πŸ“„ 2 papers
ICLREmerging
πŸ“„ 2 papers
ICMLEmerging
πŸ“„ 2 papers
ImageNet-1KEmerging
πŸ“„ 2 papers
MATH500Emerging
πŸ“„ 2 papers
MemBenchEmerging
πŸ“„ 2 papers
MNISTCanonical
πŸ“„ 2 papers
VQA benchmarksEmerging
πŸ“„ 2 papers
Ο„-BenchEmerging
πŸ“„ 2 papers
160-image human-labeled diagnostic benchmarkEmerging
πŸ“„ 1 paper
178 in-the-wild objectsEmerging
πŸ“„ 1 paper
2048Emerging
πŸ“„ 1 paper
33 datasetsEmerging
πŸ“„ 1 paper
7 real-world scenesEmerging
πŸ“„ 1 paper
A2UI-BenchEmerging
πŸ“„ 1 paper
ABC-BenchEmerging
πŸ“„ 1 paper
ACEEmerging
πŸ“„ 1 paper
Actor-18MEmerging
πŸ“„ 1 paper
Actor-BenchEmerging
πŸ“„ 1 paper
AF-ChatEmerging
πŸ“„ 1 paper
AFHQv2Emerging
πŸ“„ 1 paper
AF-ThinkEmerging
πŸ“„ 1 paper
AgencyBenchEmerging
πŸ“„ 1 paper
AgentIF-OneDayEmerging
πŸ“„ 1 paper
AgentSearchBenchEmerging
πŸ“„ 1 paper
Agents Research EnvironmentsEmerging
πŸ“„ 1 paper
AIBenchEmerging
πŸ“„ 1 paper
alice29.txtEmerging
πŸ“„ 1 paper
Amazon FashionEmerging
πŸ“„ 1 paper
Animal-FacesEmerging
πŸ“„ 1 paper
Ann ArborEmerging
πŸ“„ 1 paper
ArabidopsisEmerging
πŸ“„ 1 paper
ARC-AGI-1Emerging
πŸ“„ 1 paper
ARC-CEmerging
πŸ“„ 1 paper
arXivEmerging
πŸ“„ 1 paper
AssetOpsBench (AOB)Emerging
πŸ“„ 1 paper
AstroReason-BenchEmerging
πŸ“„ 1 paper
AudioSkills-XLEmerging
πŸ“„ 1 paper
BABEEmerging
πŸ“„ 1 paper
BAGELEmerging
πŸ“„ 1 paper
BFCLv3Emerging
πŸ“„ 1 paper
BIOSCAN-5MEmerging
πŸ“„ 1 paper
Bioscan-TraitsEmerging
πŸ“„ 1 paper