StereoSet

Name: StereoSet
License: cc-by-sa-4.0

Emerging

7papers using it

3,326HF downloads

30HF likes

2024first seen

Dataset Card for StereoSet Dataset Summary StereoSet is a dataset that measures stereotype bias in language models. StereoSet consists of 17,000 sentences that measures model preferences across gender, race, religion, and profession. Supported Tasks and Leaderboards multiple-choice question answering Languages English

🤗 Hugging Face⚖ cc-by-sa-4.0

Papers using StereoSet (7)

A Comprehensive Study of Implicit and Explicit Biases in Large Language Models2025

No Free Lunch in Language Model Bias Mitigation? Targeted Bias Reduction Can Exacerbate Unmitigated LLM Biases2025

Addressing Stereotypes in Large Language Models: A Critical Examination and Mitigation2025

Open-DeBias: Toward Mitigating Open-Set Bias in Language Models2025

Rethinking Prompt-based Debiasing in Large Language Models2025

Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs2025

Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework2024 · 1 cites