Awesome AI Agents
π
Papers
π§
Topics
π₯
Trending
πΊοΈ
Map
π
Leaderboards
π
Learn
π€
Ask AI
β―
More
π₯
Authors
π
Reading Packs
π
Datasets
π οΈ
Tools
π°
News
π
Blogs
βοΈ
Newsletter
π
Saved
+ Add Paper
βΎ
β
β all datasets
BIG-bench
Emerging
1
papers using it
2026
first seen
Papers using BIG-bench (1)
Evaluating Agentic AI In The Wild: Failure Modes, Drift Patterns, And A Production Evaluation Framework
2026
π€
Ask AI
BIG-bench β datasets β ai-agents