← all datasets

WMDP

Emerging
9papers using it
25,671HF downloads
27HF likes
2024first seen

Dataset Card for WMDP The Weapons of Mass Destruction Proxy (WMDP) benchmark is a dataset of multiple-choice questions that serve as a proxy measurement of hazardous knowledge in biosecurity, cybersecurity, and chemical security. WMDP serves two roles: first, as an evaluation for hazardous knowledge in LLMs, and second

Papers using WMDP (9)

WMDP β€” datasets β€” llm-papers