Gender Bias in Big Data Analysis

被引:3
作者
Misa, Thomas J.
机构
来源
INFORMATION & CULTURE | 2022年 / 57卷 / 03期
关键词
gender bias; algorithmic bias; big data; history of computing; computer science research; digital humanities; WOMEN; HISTORY;
D O I
10.7560/IC57303
中图分类号
K [历史、地理];
学科分类号
06 ;
摘要
This article combines humanistic "data critique" with informed inspec-tion of big data analysis. It measures gender bias when gender prediction software tools (Gender API, Namsor, and Genderize.io) are used in historical big data research. Gender bias is measured by contrasting personally identified com-puter science authors in the well-regarded DBLP dataset (1950-80) with exactly comparable results from the software tools. Implications for public understanding of gender bias in comput-ing and the nature of the computing profession are outlined. Preliminary assessment of the Semantic Scholar dataset is presented. The conclu-sion combines humanistic approaches with selective use of big data methods.
引用
收藏
页码:283 / 306
页数:25
相关论文
共 87 条
[1]  
Abbate Janet., 2012, RECODING GENDER WOME, P3
[2]  
Abbate Thomas, 2021, NEW HIST MODERN COMP, V1, P63
[3]  
Amy E, 2010, SLATONS RACE RIGOR S
[4]  
[Anonymous], 2009, C PRESENTATION 2009, P2
[5]  
[Anonymous], 2012, WOMEN SCI AM
[6]  
[Anonymous], 1956, HUGGETTS REV MEDICAL
[7]   Information History: Searching for Identity [J].
Aspray, William .
INFORMATION & CULTURE, 2019, 54 (01) :69-75
[8]   Quantifying Gender Bias in Different Corpora [J].
Babaeianjelodar, Marzieh ;
Lorenz, Stephen ;
Gordon, Josh ;
Matthews, Jeanna ;
Freitag, Evan .
WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, :752-759
[9]  
Bartscherer Thomas, 2011, SWITCHING CODES THIN, P2
[10]   How to Respond to Data Science: Early Data Criticism by Lionel Trilling [J].
Beaton, Brian .
INFORMATION & CULTURE, 2016, 51 (03) :352-372