← all datasets

BrowseComp-ZH

Emerging
6papers using it
1,364HF downloads
7HF likes
2025first seen

🧭 BrowseComp-ZH: Benchmarking the Web Browsing Ability of Large Language Models in Chinese BrowseComp-ZH is the first high-difficulty benchmark specifically designed to evaluate the real-world web browsing and reasoning capabilities of large language models (LLMs) in the Chinese information ecosystem. Inspired by Brow

Papers using BrowseComp-ZH (6)

BrowseComp-ZH β€” datasets β€” llm-papers