Bias in Machine Learning: A Literature Review

被引:4
作者
Mavrogiorgos, Konstantinos [1 ]
Kiourtis, Athanasios [1 ]
Mavrogiorgou, Argyro [1 ]
Menychtas, Andreas [1 ]
Kyriazis, Dimosthenis [1 ]
机构
[1] Univ Piraeus, Dept Digital Syst, Piraeus 18534, Greece
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 19期
关键词
bias; algorithms; machine learning; artificial intelligence; literature review; NEURAL-NETWORKS; REGULARIZATION; PERFORMANCE; SELECTION; DROPOUT; MODEL; REGRESSION; FEATURES; LASSO; RIDGE;
D O I
10.3390/app14198860
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Bias could be defined as the tendency to be in favor or against a person or a group, thus promoting unfairness. In computer science, bias is called algorithmic or artificial intelligence (i.e., AI) and can be described as the tendency to showcase recurrent errors in a computer system, which result in "unfair" outcomes. Bias in the "outside world" and algorithmic bias are interconnected since many types of algorithmic bias originate from external factors. The enormous variety of different types of AI biases that have been identified in diverse domains highlights the need for classifying the said types of AI bias and providing a detailed overview of ways to identify and mitigate them. The different types of algorithmic bias that exist could be divided into categories based on the origin of the bias, since bias can occur during the different stages of the Machine Learning (i.e., ML) lifecycle. This manuscript is a literature study that provides a detailed survey regarding the different categories of bias and the corresponding approaches that have been proposed to identify and mitigate them. This study not only provides ready-to-use algorithms for identifying and mitigating bias, but also enhances the empirical knowledge of ML engineers to identify bias based on the similarity that their use cases have to other approaches that are presented in this manuscript. Based on the findings of this study, it is observed that some types of AI bias are better covered in the literature, both in terms of identification and mitigation, whilst others need to be studied more. The overall contribution of this research work is to provide a useful guideline for the identification and mitigation of bias that can be utilized by ML engineers and everyone who is interested in developing, evaluating and/or utilizing ML models.
引用
收藏
页数:40
相关论文
共 231 条
  • [11] New machine learning approaches to improve reference evapotranspiration estimates using intra-daily temperature-based variables in a semi-arid region of Spain
    Antonio Bellido-Jimenez, Juan
    Estevez, Javier
    Penelope Garcia-Marin, Amanda
    [J]. AGRICULTURAL WATER MANAGEMENT, 2021, 245
  • [12] Evaluation of Gender Bias in Facial Recognition with Traditional Machine Learning Algorithms
    Atay, Mustafa
    Gipson, Hailey
    Gwyn, Tony
    Roy, Kaushik
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [13] Automated Threat Report Classification Over Multi-Source Data
    Ayoade, Gbadebo
    Chandra, Swarup
    Khan, Latifur
    Hamlen, Kevin
    Thuraisingham, Bhavani
    [J]. 2018 4TH IEEE INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2018), 2018, : 236 - 245
  • [14] ChatGPT: More Human-Like Than Computer-Like, but Not Necessarily in a Good Way
    Azaria, Amos
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 468 - 473
  • [15] Baker M.R., 2023, J. Eng. Res. (Ponta Grossa), DOI 10.1016/j.jer.2023.11.023
  • [16] Bosco: Boosting Corrections for Genome-Wide Association Studies With Imbalanced Samples
    Bao, Feng
    Deng, Yue
    Zhao, Yanyu
    Suo, Jinli
    Dai, Qionghai
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2017, 16 (01) : 69 - 77
  • [17] Bareinboim E., 2022, PROBABILISTIC CAUSAL, P433, DOI [10.1145/3501714.3501740?casatoken=QEfNtuGGD4AAAAA:dr0ujPx5fsRntIpzbFofCEI0A2EWtNIhx1b4DVg1BOTzTk2iTbHf2tLS2nYPSXFWYWz0mMBlL5j, DOI 10.1145/3501714.3501740?CASATOKEN=QEFNTUGGD4AAAAA:DR0UJPX5FSRNTIPZBFOFCEI0A2EWTNIHX1B4DVG1BOTZTK2ITBHF2TLS2NYPSXFWYWZ0MMBLL5J]
  • [18] Big Data's Disparate Impact
    Barocas, Solon
    Selbst, Andrew D.
    [J]. CALIFORNIA LAW REVIEW, 2016, 104 (03) : 671 - 732
  • [19] Behfar Stefan Kambiz, 2023, 2023 Fifth International Conference on Blockchain Computing and Applications (BCCA), P643, DOI 10.1109/BCCA58897.2023.10338888
  • [20] Features of residential energy consumption: Evidence from France using an innovative multilevel modelling approach
    Belaid, Fateh
    Roubaud, David
    Galariotis, Emilios
    [J]. ENERGY POLICY, 2019, 125 : 277 - 285