← all datasets

ATBench

Emerging

5papers using it

2026first seen

🔎 Find this dataset

Papers using ATBench (5)

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security2026

Autonomous computational catalysis through an agentic research system2026 · 2 cites

Ask Now, Use Later: Benchmarking the Proactivity Gap in Long-Lived LLM Agents2026

Content-Aware Attack Detection in LLM Agent Tool-Call Traffic: An Empirical Study of Features, Architectures, and Evaluation Protocols2026

RAT: RunAnyThing via Fully Automated Environment Configuration2026

ATBench dataset — papers, benchmarks & downloads · AI Agents