Sokoban

Emerging

7papers using it

2019first seen

Sokoban is a benchmark dataset used to evaluate the performance of agents in solving spatial reasoning and planning tasks involving pushing boxes to designated locations within a grid-based environment.

🔎 Find this dataset

Papers using Sokoban (5)

Retrospective Progress-Aware Self-Refinement for LLM Agent Training2026

TAPE: Tool-guided Adaptive Planning And Constrained Execution In Language Model Agents2026

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents2026

TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents2026

Meta-RL Induces Exploration in Language Agents2025