← all datasets

AppWorld

Emerging
15papers using it
2025first seen

The 'AppWorld' dataset/benchmark contains a collection of applications and their associated environments, used to evaluate the performance of language agents in understanding and executing user instructions within complex contexts.

Papers using AppWorld (15)

AppWorld β€” datasets β€” ai-agents