Writing Style Matters: An Examination Of Bias And Fairness In Information Retrieval Systems
2024 Β· Hongliu Cao
Abstract
The rapid advancement of Language Model technologies has opened new opportunities, but also introduced new challenges related to bias and fairness. This paper explores the uncharted territory of potential biases in state-of-the-art universal text embedding models towards specific document and query writing styles within Information Retrieval (IR) systems. Our investigation reveals that different embedding models exhibit different preferences of document writing style, while more informal and emotive styles are less favored by most embedding models. In terms of query writing styles, many embedding models tend to match the style of the query with the style of the retrieved documents, but some show a consistent preference for specific styles. Text embedding models fine-tuned on synthetic data generated by LLMs display a consistent preference for certain style of generated data. These biases in text embedding based IR systems can inadvertently silence or marginalize certain communication s
Authors
(none)
Tags
Stats
Related papers
- Debiasing Gender Bias In Information Retrieval Models (2022)0.00
- An Empirical Study Of Position Bias In Modern Information Retrieval (2025)1.69
- Quantifying Positional Biases In Text Embedding Models (2024)0.00
- Do Neural Ranking Models Intensify Gender Bias? (2020)12.47
- Invisible Relevance Bias: Text-image Retrieval Models Prefer Ai-generated Images (2023)9.23
- Uni-retrieval: A Multi-style Retrieval Framework For Stem's Education (2025)3.58
- Mitigating Test-time Bias For Fair Image Retrieval (2023)0.00
- Formalized Information Needs Improve Large-language-model Relevance Judgments (2026)0.00