← all datasets

OSWorld

Canonical
28papers using it
2024first seen

OSWorld is a benchmark that contains configurations for multi-application environments used to evaluate the performance of computer-use agents (CUAs) in interacting with graphical desktops.

Papers using OSWorld (28)

OSWorld β€” datasets β€” ai-agents