← all datasets

AgentDyn

Emerging
1papers using it
2026first seen

AgentDyn is a manually designed benchmark containing 60 dynamic open-ended tasks used to evaluate the vulnerability of real-world AI agent security systems to prompt injection attacks.

Papers using AgentDyn (1)

AgentDyn β€” datasets β€” cybersecurity