LifelongAgentBench

Emerging

2papers using it

2025first seen

LifelongAgentBench is a unified benchmark designed to systematically assess the lifelong learning ability of large language model (LLM) agents through skill-grounded, interdependent tasks across three interactive environments: Database, Operating System, and Knowledge Graph.

🔎 Find this dataset

Papers using LifelongAgentBench (2)

Learning While Acting: A Skill-Enhanced Test-Time Co-Evolution Framework for Online Lifelong Learning Agents2026

LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners2025