πŸ“Š Datasets β€” Awesome Robotics

304 datasets & benchmarks β€” 15 canonical foundations plus emerging datasets mined from recent papers. Each links to the papers that use it.

304 of 304 datasets
LIBEROCanonical

This dataset was created using LeRobot. Dataset Structure meta/info.json: { "codebase_version": "v3.0", "robot_type": "panda", "total_episodes": 1693, "total_frames": 273465, "total_tasks": 40, "chunks_size": 1000, "fps": 10.0, "splits": { "train": "0:1693" }, "data_path": "data/chunk-{chunk_index:03d}/file-{file_index:03d}.parquet", "video_path": "videos/{video_key}/chunk-{chunk_index:03d}/file-{file_index:03d}.mp4"… See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceVLA/libero.

πŸ“„ 3 papers⬇ 23.2kπŸ’› 60πŸ€— HFapache-2.0
GAIAEmerging

GAIA dataset GAIA is a benchmark which aims at evaluating next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc). We added gating to prevent bots from scraping the dataset. Please do not reshare the validation or test set in a crawlable format. Data and leaderboard GAIA is made of more than 450 non-trivial question with an unambiguous answer, requiring different levels of tooling and autonomy to… See the full description on the dataset page: https://huggingface.co/datasets/gaia-benchmark/GAIA.

πŸ“„ 2 papers⬇ 42.2kπŸ’› 692πŸ€— HF
C. elegansEmerging
πŸ“„ 2 papers
COCOEmerging
πŸ“„ 2 papers
RobosuiteEmerging
πŸ“„ 2 papers
ScanNet++Emerging
πŸ“„ 2 papers
Science Q&AEmerging
πŸ“„ 2 papers
ManiSkillCanonical

ManiSkill Data ManiSkill is a unified benchmark for learning generalizable robotic manipulation skills powered by SAPIEN. It features 20 out-of-box task families with 2000+ diverse object models and 4M+ demonstration frames. Moreover, it empowers fast visual input learning algorithms so that a CNN-based policy can collect samples at about 2000 FPS with 1 GPU and 16 processes on a workstation. The benchmark can be used to study a wide range of algorithms: 2D & 3D vision-based… See the full description on the dataset page: https://huggingface.co/datasets/haosulab/ManiSkill.

πŸ“„ 1 paper⬇ 259πŸ’› 3πŸ€— HFapache-2.0
CALVINCanonical

Dataset Card for "clavin" More Information needed

πŸ“„ 1 paper⬇ 13πŸ€— HF
100-site flux ladderEmerging
πŸ“„ 1 paper
236 cleaned casesEmerging
πŸ“„ 1 paper
26B-A4B Mixture-of-ExpertsEmerging
πŸ“„ 1 paper
3D industrial atmospheric flow applicationEmerging
πŸ“„ 1 paper
40 real world transcriptsEmerging
πŸ“„ 1 paper
444 LiveCodeBenchEmerging
πŸ“„ 1 paper
54 synthetic benchmark cellsEmerging
πŸ“„ 1 paper
88 eGeMAPSEmerging
πŸ“„ 1 paper
89 containerized tasksEmerging
πŸ“„ 1 paper
ADE20KEmerging
πŸ“„ 1 paper
ADNI-1Emerging
πŸ“„ 1 paper
adversarial dataset of 103 clinical MCQsEmerging
πŸ“„ 1 paper
Affordance20QEmerging
πŸ“„ 1 paper
AgentComm-BenchEmerging
πŸ“„ 1 paper
AgentCyberRangeEmerging
πŸ“„ 1 paper
AIME 2025Emerging
πŸ“„ 1 paper
AirSimEmerging
πŸ“„ 1 paper
ALFWorldEmerging
πŸ“„ 1 paper
Alpamayo R1Emerging
πŸ“„ 1 paper
AMCEmerging
πŸ“„ 1 paper
Android WorldEmerging
πŸ“„ 1 paper
AppWorldEmerging
πŸ“„ 1 paper
A-share factor discoveryEmerging
πŸ“„ 1 paper
AuctionNetEmerging
πŸ“„ 1 paper
AudioCapsEmerging
πŸ“„ 1 paper
AudioDEREmerging
πŸ“„ 1 paper
AuthorBenchEmerging
πŸ“„ 1 paper
BCI-IV-2aEmerging
πŸ“„ 1 paper
BLINKEmerging
πŸ“„ 1 paper
BLINK Multi-viewEmerging
πŸ“„ 1 paper
BraTS21Emerging
πŸ“„ 1 paper
BraTS-PEDsEmerging
πŸ“„ 1 paper
CAEmerging
πŸ“„ 1 paper
CageEmerging
πŸ“„ 1 paper
CAGE Challenge 4Emerging
πŸ“„ 1 paper
CARLAEmerging
πŸ“„ 1 paper
ChemLexEmerging
πŸ“„ 1 paper
Chinese Mobile Screen Teach BenchmarkEmerging
πŸ“„ 1 paper
chiral XXX chainsEmerging
πŸ“„ 1 paper
CHIRPSEmerging
πŸ“„ 1 paper
ChronoIDEmerging
πŸ“„ 1 paper
CIFAR-100Emerging
πŸ“„ 1 paper
CityscapesEmerging
πŸ“„ 1 paper
Clay v1.5Emerging
πŸ“„ 1 paper
ClinHalluEmerging
πŸ“„ 1 paper
CLIPEmerging
πŸ“„ 1 paper
COCO 2017Emerging
πŸ“„ 1 paper
CodeforcesEmerging
πŸ“„ 1 paper
Common Vulnerabilities and Exposures (CVE)Emerging
πŸ“„ 1 paper
ContactWorldEmerging
πŸ“„ 1 paper
COSMO-SkyMedEmerging
πŸ“„ 1 paper