Sokoban
Emerging7papers using it
2019first seen
Papers using Sokoban (7)
- Retrospective Progress-Aware Self-Refinement for LLM Agent TrainingTAPE: Tool-guided Adaptive Planning And Constrained Execution In Language Model AgentsHiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM AgentsTSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM AgentsMeta-RL Induces Exploration in Language AgentsAn investigation of model-free planningSolving Sokoban with forward-backward reinforcement learning