GPT-OSS-Safeguard-20B
Emerging1papers using it
2026first seen
The 'GPT-OSS-Safeguard-20B' is a benchmark dataset used to evaluate the effectiveness of adversarial attack algorithms on large language models, specifically in the context of jailbreaking and prompt injection.