← all datasets

AgentDojo

Emerging

9papers using it

2026first seen

AgentDojo is a benchmark dataset used to evaluate the performance and security of tool-using large language model agents in various scenarios.

🔎 Find this dataset

Papers using AgentDojo (6)

Adaptive Evaluation of Out-of-Band Defenses Against Prompt Injection in LLM Agents2026

Beyond Attack-Success Rate: Action-Graded Severity Scale for Tool-Using AI Agents2026

SecureClaw: Clawing Back Control of LLM Agents2026

IterInject: Indirect Prompt Injection Against LLM Agents via Feedback-Guided Iterative Optimization2026

Agentrim: Tool Risk Mitigation For Agentic AI2026

Optimizing Agent Planning for Security and Autonomy2026

AgentDojo dataset — papers, benchmarks & downloads · AI Agents