← all datasets

HarmBench

Emerging
7papers using it
8,603HF downloads
46HF likes
2024first seen

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal Paper: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal Data: Dataset About In this dataset card, we only use the behavior prompts proposed in HarmBench. License MIT Citation If you fin

Papers using HarmBench (7)

HarmBench β€” datasets β€” llm-papers