← all datasets

BeaverTails

Emerging
1papers using it
2026first seen

The 'BeaverTails' dataset is a benchmark used to evaluate the effectiveness of defenses against adversarial attacks on open-weight large language models (LLMs).

Papers using BeaverTails (1)

BeaverTails β€” datasets β€” cybersecurity