← all datasets

56 tasks

Emerging
1papers using it
2026first seen

The '56 tasks' dataset is a novel simulation benchmark that includes a variety of tasks designed to evaluate multistep reasoning and linguistic variation in robotic manipulation scenarios.

Papers using 56 tasks (1)

56 tasks β€” datasets β€” robotics