A Survey on Bias in Deep NLP

被引:70
作者
Garrido-Munoz, Ismael [1 ]
Montejo-Raez, Arturo [1 ]
Martinez-Santiago, Fernando [1 ]
Urena-Lopez, L. Alfonso [1 ]
机构
[1] Ctr Estudios Avanzados TIC CEATIC, Jaen 230071, Spain
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 07期
关键词
natural language processing; deep learning; biased models;
D O I
10.3390/app11073184
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Deep neural networks are hegemonic approaches to many machine learning areas, including natural language processing (NLP). Thanks to the availability of large corpora collections and the capability of deep architectures to shape internal language mechanisms in self-supervised learning processes (also known as "pre-training"), versatile and performing models are released continuously for every new network design. These networks, somehow, learn a probability distribution of words and relations across the training collection used, inheriting the potential flaws, inconsistencies and biases contained in such a collection. As pre-trained models have been found to be very useful approaches to transfer learning, dealing with bias has become a relevant issue in this new scenario. We introduce bias in a formal way and explore how it has been treated in several networks, in terms of detection and correction. In addition, available resources are identified and a strategy to deal with bias in deep NLP is proposed.
引用
收藏
页数:26
相关论文
共 87 条
[71]  
Rudinger Rachel, 2018, P 2018 C N AM CHAPTE, V2, P8, DOI [10.18653/v1/N18-2002, DOI 10.18653/V1/N18-2002, 10.18653/v1/n18-2002]
[72]  
Schneider D.J., 2005, PSYCHOL STEREOTYPING
[73]  
Sheng E, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3407
[74]  
Stanovsky G, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1679
[75]  
Stubbs Michael., 1996, TEXT CORPUS ANAL COM
[76]   What are the Biases in My Word Embedding? [J].
Swinger, Nathaniel ;
De-Arteaga, Maria ;
Heffernan, Neil Thomas ;
Leiserson, Mark D. M. ;
Kalai, Adam Tauman .
AIES '19: PROCEEDINGS OF THE 2019 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2019, :305-311
[77]  
Tan YC, 2019, ADV NEUR IN, V32
[78]  
Tolan S, 2019, PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2019, P83, DOI 10.1145/3322640.3326705
[79]  
Verma S, 2018, 2018 IEEE/ACM INTERNATIONAL WORKSHOP ON SOFTWARE FAIRNESS (FAIRWARE 2018), P1, DOI [10.1145/3194770.3194776, 10.23919/FAIRWARE.2018.8452913]
[80]  
Vig J, 2019, PROCEEDINGS OF THE 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, (ACL 2019), P37