Bias in Machine Learning: A Literature Review

被引：4

作者：

Mavrogiorgos, Konstantinos ^{[1
]}

Kiourtis, Athanasios ^{[1
]}

Mavrogiorgou, Argyro ^{[1
]}

Menychtas, Andreas ^{[1
]}

Kyriazis, Dimosthenis ^{[1
]}

机构：

[1] Univ Piraeus, Dept Digital Syst, Piraeus 18534, Greece

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 19期

关键词：

bias; algorithms; machine learning; artificial intelligence; literature review; NEURAL-NETWORKS; REGULARIZATION; PERFORMANCE; SELECTION; DROPOUT; MODEL; REGRESSION; FEATURES; LASSO; RIDGE;

D O I：

10.3390/app14198860

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Bias could be defined as the tendency to be in favor or against a person or a group, thus promoting unfairness. In computer science, bias is called algorithmic or artificial intelligence (i.e., AI) and can be described as the tendency to showcase recurrent errors in a computer system, which result in "unfair" outcomes. Bias in the "outside world" and algorithmic bias are interconnected since many types of algorithmic bias originate from external factors. The enormous variety of different types of AI biases that have been identified in diverse domains highlights the need for classifying the said types of AI bias and providing a detailed overview of ways to identify and mitigate them. The different types of algorithmic bias that exist could be divided into categories based on the origin of the bias, since bias can occur during the different stages of the Machine Learning (i.e., ML) lifecycle. This manuscript is a literature study that provides a detailed survey regarding the different categories of bias and the corresponding approaches that have been proposed to identify and mitigate them. This study not only provides ready-to-use algorithms for identifying and mitigating bias, but also enhances the empirical knowledge of ML engineers to identify bias based on the similarity that their use cases have to other approaches that are presented in this manuscript. Based on the findings of this study, it is observed that some types of AI bias are better covered in the literature, both in terms of identification and mitigation, whilst others need to be studied more. The overall contribution of this research work is to provide a useful guideline for the identification and mitigation of bias that can be utilized by ML engineers and everyone who is interested in developing, evaluating and/or utilizing ML models.

引用

页数：40

共 231 条

[11] New machine learning approaches to improve reference evapotranspiration estimates using intra-daily temperature-based variables in a semi-arid region of Spain
Antonio Bellido-Jimenez, Juan
Estevez, Javier
Penelope Garcia-Marin, Amanda
[J]. AGRICULTURAL WATER MANAGEMENT, 2021, 245
[12] Evaluation of Gender Bias in Facial Recognition with Traditional Machine Learning Algorithms
Atay, Mustafa
Gipson, Hailey
Gwyn, Tony
Roy, Kaushik
[J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[13] Automated Threat Report Classification Over Multi-Source Data
Ayoade, Gbadebo
Chandra, Swarup
Khan, Latifur
Hamlen, Kevin
Thuraisingham, Bhavani
[J]. 2018 4TH IEEE INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2018), 2018, : 236 - 245
[14] ChatGPT: More Human-Like Than Computer-Like, but Not Necessarily in a Good Way
Azaria, Amos
[J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 468 - 473
[15] Baker M.R., 2023, J. Eng. Res. (Ponta Grossa), DOI 10.1016/j.jer.2023.11.023
[16] Bosco: Boosting Corrections for Genome-Wide Association Studies With Imbalanced Samples
Bao, Feng
Deng, Yue
Zhao, Yanyu
Suo, Jinli
Dai, Qionghai
[J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2017, 16 (01) : 69 - 77
[17] Bareinboim E., 2022, PROBABILISTIC CAUSAL, P433, DOI [10.1145/3501714.3501740?casatoken=QEfNtuGGD4AAAAA:dr0ujPx5fsRntIpzbFofCEI0A2EWtNIhx1b4DVg1BOTzTk2iTbHf2tLS2nYPSXFWYWz0mMBlL5j, DOI 10.1145/3501714.3501740?CASATOKEN=QEFNTUGGD4AAAAA:DR0UJPX5FSRNTIPZBFOFCEI0A2EWTNIHX1B4DVG1BOTZTK2ITBHF2TLS2NYPSXFWYWZ0MMBLL5J]
[18] Big Data's Disparate Impact
Barocas, Solon
Selbst, Andrew D.
[J]. CALIFORNIA LAW REVIEW, 2016, 104 (03) : 671 - 732
[19] Behfar Stefan Kambiz, 2023, 2023 Fifth International Conference on Blockchain Computing and Applications (BCCA), P643, DOI 10.1109/BCCA58897.2023.10338888
[20] Features of residential energy consumption: Evidence from France using an innovative multilevel modelling approach
Belaid, Fateh
Roubaud, David
Galariotis, Emilios
[J]. ENERGY POLICY, 2019, 125 : 277 - 285

← 1 2 3 4 5 6 7 8 9 10 →