← all datasets

LongMemEval-S

Emerging
10papers using it
4HF downloads
0HF likes
2025first seen

The 'LongMemEval-S' dataset/benchmark is used to evaluate the performance of long-context LLM agents in managing persistent state across interactions.

Papers using LongMemEval-S (10)