AgentDyn
Emerging1papers using it
2026first seen
AgentDyn is a manually designed benchmark containing 60 dynamic open-ended tasks used to evaluate the vulnerability of real-world AI agent security systems to prompt injection attacks.
AgentDyn is a manually designed benchmark containing 60 dynamic open-ended tasks used to evaluate the vulnerability of real-world AI agent security systems to prompt injection attacks.