Gender Bias in Big Data Analysis

被引:3
作者
Misa, Thomas J.
机构
来源
INFORMATION & CULTURE | 2022年 / 57卷 / 03期
关键词
gender bias; algorithmic bias; big data; history of computing; computer science research; digital humanities; WOMEN; HISTORY;
D O I
10.7560/IC57303
中图分类号
K [历史、地理];
学科分类号
06 ;
摘要
This article combines humanistic "data critique" with informed inspec-tion of big data analysis. It measures gender bias when gender prediction software tools (Gender API, Namsor, and Genderize.io) are used in historical big data research. Gender bias is measured by contrasting personally identified com-puter science authors in the well-regarded DBLP dataset (1950-80) with exactly comparable results from the software tools. Implications for public understanding of gender bias in comput-ing and the nature of the computing profession are outlined. Preliminary assessment of the Semantic Scholar dataset is presented. The conclu-sion combines humanistic approaches with selective use of big data methods.
引用
收藏
页码:283 / 306
页数:25
相关论文
共 87 条
[81]  
Trim M., 2020, ACM Computers Society, V49, P11, DOI DOI 10.1145/3447903.3447908
[82]   Grounding Digital History in the History of Computing [J].
Turkel, William J. ;
Muhammedi, Shezan ;
Start, Mary Beth .
IEEE ANNALS OF THE HISTORY OF COMPUTING, 2014, 36 (02) :72-75
[83]  
Wais K, 2016, R J, V8, P17
[84]   Gender Trends in Computer Science Authorship [J].
Wang, Lucy Lu ;
Stanovsky, Gabriel ;
Weihs, Luca ;
Etzioni, Oren .
COMMUNICATIONS OF THE ACM, 2021, 64 (03) :78-84
[85]   Self-Fulfilling History: How Narrative Shapes Preservation of the Online World [J].
Weber, Marc .
INFORMATION & CULTURE, 2016, 51 (01) :54-80
[86]   The Role of Gender in Scholarly Authorship [J].
West, Jevin D. ;
Jacquet, Jennifer ;
King, Molly M. ;
Correll, Shelley J. ;
Bergstrom, Carl T. .
PLOS ONE, 2013, 8 (07)
[87]   Advance gender prediction tool of first names and its use in analysing gender disparity in Computer Science in the UK, Malaysia and China [J].
Zhao, Hua ;
Kamareddine, Fairouz .
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, :222-227