← all datasets

CaP-Bench

Emerging
2papers using it
2026first seen

CaP-Bench is a benchmark that evaluates the performance of frontier language and vision-language models in robot manipulation across varying levels of abstraction, interaction, and perceptual grounding.

Papers using CaP-Bench (2)

CaP-Bench β€” datasets β€” robotics