AppWorld
Emerging1papers using it
2026first seen
'AppWorld' is a public benchmark consisting of 14 heterogeneous agents used to evaluate skill-conditional trust in agent swarms.
'AppWorld' is a public benchmark consisting of 14 heterogeneous agents used to evaluate skill-conditional trust in agent swarms.