xbench-DeepSearch
Emerging4papers using it
2025first seen
The 'xbench-DeepSearch' dataset/benchmark contains synthesized samples used to evaluate the performance of search agents, particularly in the context of complex, multi-hop reasoning tasks.
Papers using xbench-DeepSearch (4)
- OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training DataMiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research TasksWebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling
Info-Rich SeekingMulti-Agent Deep Research: Training Multi-Agent Systems with M-GRPO