← all datasets

ScienceWorld

Canonical
18papers using it
2024first seen

ScienceWorld is a benchmark dataset used to evaluate the performance of LLM agents in skill orchestration and execution within structured environments.

Papers using ScienceWorld (18)

ScienceWorld β€” datasets β€” ai-agents