CaP-Bench
Emerging2papers using it
2026first seen
CaP-Bench is a benchmark that evaluates the performance of frontier language and vision-language models in robot manipulation across varying levels of abstraction, interaction, and perceptual grounding.