LifelongAgentBench
Emerging2papers using it
2025first seen
LifelongAgentBench is a unified benchmark designed to systematically assess the lifelong learning ability of large language model (LLM) agents through skill-grounded, interdependent tasks across three interactive environments: Database, Operating System, and Knowledge Graph.