WMDP
Emerging2papers using it
25,612HF downloads
27HF likes
2026first seen
Dataset Card for WMDP The Weapons of Mass Destruction Proxy (WMDP) benchmark is a dataset of multiple-choice questions that serve as a proxy measurement of hazardous knowledge in biosecurity, cybersecurity, and chemical security. WMDP serves two roles: first, as an evaluation for hazardous knowledge in LLMs, and second, as a benchmark for unlearning methods to remove such hazardous knowledge. See our paper, website, and GitHub for more details! We implemented the WMDP evaluation in⦠See the full description on the dataset page: https://huggingface.co/datasets/cais/wmdp.
π€ Hugging Faceβ mit