consensus-KNOWLEDGE
Emerging1papers using it
2026first seen
The 'consensus-KNOWLEDGE' dataset is a comparison set of 388 prompts that distinguishes requests for harmful security knowledge from executable malicious software, evaluated through a consensus protocol involving five large-language-model judges.