← all datasets

AgentDojo

Emerging
7papers using it
2026first seen

AgentDojo is a benchmark dataset used to evaluate the performance and security of tool-using large language model agents in various scenarios.

Papers using AgentDojo (7)

AgentDojo β€” datasets β€” ai-agents