Toward creating a fairer ranking in search engine results

被引:66
|
作者
Gao, Ruoyuan [1 ]
Shah, Chirag [2 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
[2] Univ Washington, Informat Sch, Seattle, WA 98195 USA
关键词
Information retrieval; Search engine bias; Fairness ranking; Relevance; Diversity; Novelty; BIAS; IMPACT;
D O I
10.1016/j.ipm.2019.102138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity and social influence of search engines in IR, various studies have raised concerns on the presence of bias in search engines and the social responsibilities of IR systems. As an essential component of search engine, ranking is a crucial mechanism in presenting the search results or recommending items in a fair fashion. In this article, we focus on the top-k diversity fairness ranking in terms of statistical parity fairness and disparate impact fairness. The former fairness definition provides a balanced overview of search results where the number of documents from different groups are equal; The latter enables a realistic overview where the proportion of documents from different groups reflect the overall proportion. Using 100 queries and top 100 results per query from Google as the data, we first demonstrate how topical diversity bias is present in the top web search results. Then, with our proposed entropy-based metrics for measuring the degree of bias, we reveal that the top search results are unbalanced and disproportionate to their overall diversity distribution. We explore several fairness ranking strategies to investigate the relationship between fairness, diversity, novelty and relevance. Our experimental results show that using a variant of fair epsilon-greedy strategy, we could bring more fairness and enhance diversity in search results without a cost of relevance. In fact, we can improve the relevance and diversity by introducing the diversity fairness. Additional experiments with TREC datasets containing 50 queries demonstrate the robustness of our proposed strategies and our findings on the impact of fairness. We present a series of correlation analysis on the amount of fairness and diversity, showing that statistical parity fairness highly correlates with diversity while disparate impact fairness does not. This provides clear and tangible implications for future works where one would want to balance fairness, diversity and relevance in search results.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Web Search Engine Results Page Viewing Formats for Different Search Tasks
    Taieb-Maimon, Meirav
    Harush, Hadas
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
  • [22] A fuzzy ranking approach for improving search results in Turkish as an agglutinative language
    Uzun, Erdinc
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5658 - 5664
  • [23] Re-ranking search results using an additional retrieved list
    Meister, Lior
    Kurland, Oren
    Kalmanovich, Inna Gelfer
    INFORMATION RETRIEVAL, 2011, 14 (04): : 413 - 437
  • [24] Future-proofing Search Engine Marketing: An Empirical Investigation of Effects of Search Engine Results on Consumer Purchase Decisions
    Diwanji, Vaibhav Shwetangbhai
    Lee, Jaejin
    Cortese, Juliann
    JOURNAL OF STRATEGIC MARKETING, 2023,
  • [25] Search Engine Gender Bias
    Wijnhoven, Fons
    van Haren, Jeanna
    FRONTIERS IN BIG DATA, 2021, 4
  • [26] A Study of Optimizing Search Engine Results Through User Interaction
    Chen, Lin-Chih
    IEEE ACCESS, 2020, 8 : 79024 - 79045
  • [27] Learning to Rank for Search Results Re-ranking in Learning Experience Platforms
    Kataria, Ayush
    Venkateshprasanna, H. M.
    Kummetha, Ashok Kumar Reddy
    PROCEEDINGS OF THE 16TH ANNUAL ACM INDIA COMPUTE CONFERENCE, COMPUTE 2023, 2023, : 25 - 30
  • [28] An ontology-based approach for semantics ranking of the web search engines results
    Bouramoul, Abdelkrim
    Kholladi, Mohamed-Khireddine
    Doan, Bich-Lien
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 797 - 802
  • [29] Metasearch engines and information retrieval: Computational complexity of ranking multiple search results
    Goldberg, Robert
    Taksa, Isak
    Spink, Amanda
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2008, : 315 - +
  • [30] Search Engine Ranking, Quality, and Content of Web Pages That Are Critical Versus Noncritical of Human Papillomavirus Vaccine
    Fu, Linda Y.
    Zook, Kathleen
    Spoehr-Labutta, Zachary
    Hu, Pamela
    Joseph, Jill G.
    JOURNAL OF ADOLESCENT HEALTH, 2016, 58 (01) : 33 - 39