HarmBench
Emerging3papers using it
8,603HF downloads
46HF likes
2025first seen
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal Paper: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal Data: Dataset About In this dataset card, we only use the behavior prompts proposed in HarmBench. License MIT Citation If you fin
π€ Hugging Faceβ mit