← all datasets

WMDP

Emerging
2papers using it
25,612HF downloads
27HF likes
2026first seen

Dataset Card for WMDP The Weapons of Mass Destruction Proxy (WMDP) benchmark is a dataset of multiple-choice questions that serve as a proxy measurement of hazardous knowledge in biosecurity, cybersecurity, and chemical security. WMDP serves two roles: first, as an evaluation for hazardous knowledge in LLMs, and second, as a benchmark for unlearning methods to remove such hazardous knowledge. See our paper, website, and GitHub for more details! We implemented the WMDP evaluation in… See the full description on the dataset page: https://huggingface.co/datasets/cais/wmdp.

WMDP β€” datasets β€” graph-learning