โ† all datasets

OSWorld

Canonical
28papers using it
2024first seen

OSWorld is a benchmark that contains configurations for multi-application environments used to evaluate the performance of computer-use agents (CUAs) in interacting with graphical desktops.

Papers using OSWorld (28)

OSWorld โ€” datasets โ€” ai-agents