← all datasets

WideSearch

Emerging
2papers using it
36,834HF downloads
42HF likes
2026first seen

WideSearch: Benchmarking Agentic Broad Info-Seeking Dataset Summary WideSearch is a benchmark designed to evaluate the capabilities of Large Language Model (LLM) driven agents in broad information-seeking tasks. Unlike existing benchmarks that focus on finding a single, hard-to-find fact, WideSearch assesses an agent's

Papers using WideSearch (2)

WideSearch β€” datasets β€” ai-agents