π Datasets β Awesome Speech Audio
366 datasets & benchmarks β 20 canonical foundations plus emerging datasets mined from recent papers. Each links to the papers that use it.
CIFAR-10Emerging
CIFAR-100Emerging
ALFWorldEmerging
AudioCapsEmerging
C. elegansEmerging
GAIAEmerging
LibriSpeechCanonical
ScienceQAEmerging
SlideVQAEmerging
SWE-bench LiteEmerging
SWE-bench VerifiedEmerging
Tiny-ImageNetEmerging
WebShopEmerging
100-site flux ladderEmerging
115 developing countriesEmerging
11 benchmarksEmerging
26B-A4B Mixture-of-ExpertsEmerging
2WikiMultiHopQAEmerging
40 real world transcriptsEmerging
8,100 force-closure graspsEmerging
81 objectsEmerging
89 containerized tasksEmerging
AARRI-BenchEmerging
ACDC MRI datasetEmerging
ADNI-1Emerging
AdvBenchEmerging
adversarial dataset of 103 clinical MCQsEmerging
Affordance20QEmerging
AgNewsEmerging
AIME 2025Emerging
AIT Alert Data SetEmerging
alkali-activated slag (AAS) datasetEmerging
Alpamayo R1Emerging
AMCEmerging
Android WorldEmerging
AppWorldEmerging
ASG-BenchEmerging
AtariEmerging
Atari-style protocolsEmerging
AuctionNetEmerging
Australian TourismEmerging
Bach choralesEmerging
BCI-IV-2aEmerging
BCS_v1Emerging
Beijing PM2.5Emerging
BEIREmerging
BFCL Multi-TurnEmerging
BioAgentBenchEmerging
Blue Gene/L (BGL)Emerging
BraTS-PEDsEmerging
BRIGHTEmerging
BrowseCompEmerging
CAEmerging
CageEmerging
CAGE Challenge 4Emerging
Cancer Dependency MapEmerging
CANDOREmerging
CapTraceBenchEmerging
Caselaw Access ProjectEmerging
ChatterboxEmerging