StereoSet
Emerging7papers using it
3,326HF downloads
30HF likes
2024first seen
Dataset Card for StereoSet Dataset Summary StereoSet is a dataset that measures stereotype bias in language models. StereoSet consists of 17,000 sentences that measures model preferences across gender, race, religion, and profession. Supported Tasks and Leaderboards multiple-choice question answering Languages English
π€ Hugging Faceβ cc-by-sa-4.0
Papers using StereoSet (7)
- A Comprehensive Study of Implicit and Explicit Biases in Large Language ModelsNo Free Lunch in Language Model Bias Mitigation? Targeted Bias Reduction Can Exacerbate Unmitigated LLM BiasesAddressing Stereotypes in Large Language Models: A Critical Examination and MitigationOpen-DeBias: Toward Mitigating Open-Set Bias in Language ModelsRethinking Prompt-based Debiasing in Large Language ModelsShifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMsMitigating Social Bias in Large Language Models: A Multi-Objective
Approach within a Multi-Agent Framework