ATBench
Emerging6papers using it
2025first seen
ATBench is a benchmark used to evaluate the performance of attack detection frameworks on tool-call traffic from LLM agents, focusing on the effectiveness of various architectures and features in classifying sessions as benign or attacked.
Papers using ATBench (6)
- AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and SecurityAsk Now, Use Later: Benchmarking the Proactivity Gap in Long-Lived LLM AgentsAutonomous computational catalysis through an agentic research systemContent-Aware Attack Detection in LLM Agent Tool-Call Traffic: An Empirical Study of Features, Architectures, and Evaluation ProtocolsRAT: RunAnyThing via Fully Automated Environment ConfigurationAptbench: Benchmarking Agentic Potential Of Base Llms During Pre-training