← all datasets

LifelongAgentBench

Emerging
2papers using it
2025first seen

LifelongAgentBench is a unified benchmark designed to systematically assess the lifelong learning ability of large language model (LLM) agents through skill-grounded, interdependent tasks across three interactive environments: Database, Operating System, and Knowledge Graph.

Papers using LifelongAgentBench (2)

LifelongAgentBench β€” datasets β€” ai-agents