CaP-Bench

Emerging

2papers using it

2026first seen

CaP-Bench is a benchmark that evaluates the performance of frontier language and vision-language models in robot manipulation across varying levels of abstraction, interaction, and perceptual grounding.

🔎 Find this dataset

Papers using CaP-Bench (2)

Cap-x: A Framework For Benchmarking And Improving Coding Agents For Robot Manipulation2026

CaP-X: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation2026