← all datasets

4 safety benchmarks

Emerging
1papers using it
2026first seen

The '4 safety benchmarks' is a dataset used to evaluate the safety performance of models by assessing their outputs against specific safety criteria, ensuring that speculative decoding does not compromise safety in generated responses.

4 safety benchmarks β€” datasets β€” generative-models