Gender Bias in Big Data Analysis

被引:3
作者
Misa, Thomas J.
机构
来源
INFORMATION & CULTURE | 2022年 / 57卷 / 03期
关键词
gender bias; algorithmic bias; big data; history of computing; computer science research; digital humanities; WOMEN; HISTORY;
D O I
10.7560/IC57303
中图分类号
K [历史、地理];
学科分类号
06 ;
摘要
This article combines humanistic "data critique" with informed inspec-tion of big data analysis. It measures gender bias when gender prediction software tools (Gender API, Namsor, and Genderize.io) are used in historical big data research. Gender bias is measured by contrasting personally identified com-puter science authors in the well-regarded DBLP dataset (1950-80) with exactly comparable results from the software tools. Implications for public understanding of gender bias in comput-ing and the nature of the computing profession are outlined. Preliminary assessment of the Semantic Scholar dataset is presented. The conclu-sion combines humanistic approaches with selective use of big data methods.
引用
收藏
页码:283 / 306
页数:25
相关论文
共 87 条
[11]  
Benjamin Ruha., 2019, Race After Technology: Abolitionist Tools for the New Jim Code
[12]  
Berliner M.L., 1950, A SORSBYS HISTOPATHO, V2
[13]  
Bix A.S., 2004, NWSA J, V16, P27, DOI [10.2307/4317033, DOI 10.2307/4317033, DOI 10.2979/NWS.2004.16.1.27]
[14]  
Blanke T, 2018, DIGIT HUMANITIES Q, V12
[15]  
Blevins C, 2015, DIGIT HUMANITIES Q, V9
[16]  
Bolukbasi T, 2016, Arxiv, DOI [arXiv:1607.06520, 10.48550/arXiv.1607.06520, DOI 10.48550/ARXIV.1607.06520]
[17]  
Boulis Ann K., 2008, CHANGING FACE MED WO, P20
[18]  
Broussard M, 2018, ARTIFICIAL UNINTELLIGENCE: HOW COMPUTERS MISUNDERSTAND THE WORLD
[19]   MULTIPLE PRIMARY CARCINOMA [J].
BROWN, A ;
ELKES, AZ .
BRITISH MEDICAL JOURNAL, 1950, 2 (4676) :462-462
[20]  
Buolamwini J., 2018, Proceedings of Machine Learning Research, V81, P77