BrowseComp-ZH
Emerging6papers using it
1,364HF downloads
7HF likes
2025first seen
π§ BrowseComp-ZH: Benchmarking the Web Browsing Ability of Large Language Models in Chinese BrowseComp-ZH is the first high-difficulty benchmark specifically designed to evaluate the real-world web browsing and reasoning capabilities of large language models (LLMs) in the Chinese information ecosystem. Inspired by Brow
π€ Hugging Faceβ apache-2.0
Papers using BrowseComp-ZH (6)
- OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training DataOpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty TrajectoriesSearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep ResearchMiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research TasksBrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language
Models in ChineseReSum: Unlocking Long-Horizon Search Intelligence via Context
Summarization