Biases in Large Language Models: Origins, Inventory, and Discussion

被引:106
作者
Navigli, Roberto [1 ]
Conia, Simone [1 ]
Ross, Bjorn [2 ]
机构
[1] Sapienza Univ Rome, Via Ariosto 25, Rome, Italy
[2] Univ Edinburgh, 10 Crichton St, Edinburgh, Midlothian, Scotland
来源
ACM JOURNAL OF DATA AND INFORMATION QUALITY | 2023年 / 15卷 / 02期
关键词
Bias in NLP; language models; HEALTH;
D O I
10.1145/3597307
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we introduce and discuss the pervasive issue of bias in the large language models that are currently at the core of mainstream approaches to Natural Language Processing (NLP). We first introduce data selection bias, that is, the bias caused by the choice of texts that make up a training corpus. Then, we survey the different types of social bias evidenced in the text generated by language models trained on such corpora, ranging from gender to age, from sexual orientation to ethnicity, and from religion to culture. We conclude with directions focused on measuring, reducing, and tackling the aforementioned types of bias.
引用
收藏
页数:21
相关论文
共 132 条
[11]   Should Job Applicants Be Excited or Calm? The Role of Culture and Ideal Affect in Employment Settings [J].
Bencharit, Lucy Zhang ;
Ho, Yuen Wan ;
Fung, Helene H. ;
Yeung, Dannii Y. ;
Stephens, Nicole M. ;
Romero-Canyas, Rainer ;
Tsai, Jeanne L. .
EMOTION, 2019, 19 (03) :377-401
[12]  
Bender E. M., 2020, P 58 ANN M ASS COMP, P5185
[13]   On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? [J].
Bender, Emily M. ;
Gebru, Timnit ;
McMillan-Major, Angelina ;
Shmitchell, Shmargaret .
PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, :610-623
[14]  
Bevilacqua M, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P4330
[15]  
Bevilacqua M, 2021, AAAI CONF ARTIF INTE, V35, P12564
[16]  
Bevilacqua M, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P2854
[17]   The Gender Wage Gap: Extent, Trends, and Explanations [J].
Blau, Francine D. ;
Kahn, Lawrence M. .
JOURNAL OF ECONOMIC LITERATURE, 2017, 55 (03) :789-865
[18]  
Blevins T, 2022, Arxiv, DOI arXiv:2204.08110
[19]  
Blloshmi R, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P3786
[20]   CRITICAL QUESTIONS FOR BIG DATA Provocations for a cultural, technological, and scholarly phenomenon [J].
Boyd, Danah ;
Crawford, Kate .
INFORMATION COMMUNICATION & SOCIETY, 2012, 15 (05) :662-679