Awesome AI Agents
π
Papers
π§
Topics
π₯
Trending
πΊοΈ
Map
π
Leaderboards
π
Learn
π€
Ask AI
β―
More
π₯
Authors
π
Reading Packs
π
Datasets
π οΈ
Tools
π°
News
π
Blogs
βοΈ
Newsletter
π
Saved
+ Add Paper
βΎ
β
β all datasets
AgentBench
Canonical
1
papers using it
152
HF downloads
1
HF likes
2026
first seen
π€ Hugging Face
Papers using AgentBench (1)
Evaluating Agentic AI In The Wild: Failure Modes, Drift Patterns, And A Production Evaluation Framework
2026
π€
Ask AI
AgentBench β datasets β ai-agents