← all datasets

AppWorld

Emerging
1papers using it
2026first seen

'AppWorld' is a public benchmark consisting of 14 heterogeneous agents used to evaluate skill-conditional trust in agent swarms.

AppWorld β€” datasets β€” computer-vision