← all datasets

WideSearch

Emerging
4papers using it
25,447HF downloads
42HF likes
2025first seen

WideSearch: Benchmarking Agentic Broad Info-Seeking Dataset Summary WideSearch is a benchmark designed to evaluate the capabilities of Large Language Model (LLM) driven agents in broad information-seeking tasks. Unlike existing benchmarks that focus on finding a single, hard-to-find fact, WideSearch assesses an agent's

Papers using WideSearch (4)

WideSearch β€” datasets β€” llm-papers