The Effect of Population and "Structural" Biases on Social Media-based Algorithms - A Case Study in Geolocation Inference Across the Urban-Rural Spectrum

被引:28
作者
Johnson, Isaac [1 ]
McMahon, Connor [2 ]
Schoening, Johannes [3 ]
Hecht, Brent [1 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Univ Minnesota, GroupLens Res, Minneapolis, MN USA
[3] Univ Bremen, Bremen, Germany
来源
PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17) | 2017年
基金
美国国家科学基金会;
关键词
Algorithmic accountability; geolocation inference; population bias; social media; TWITTER;
D O I
10.1145/3025453.3026015
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Much research has shown that social media platforms have substantial population biases. However, very little is known about how these population biases affect the many algorithms that rely on social media data. Focusing on the case study of geolocation inference algorithms and their performance across the urban-rural spectrum, we establish that these algorithms exhibit significantly worse performance for underrepresented populations (i.e. rural users). We further establish that this finding is robust across both text-and network-based algorithm designs. However, we also show that some of this bias can be attributed to the design of algorithms themselves rather than population biases in the underlying data sources. For instance, in some cases, algorithms perform badly for rural users even when we substantially overcorrect for population biases by training exclusively on rural data. We discuss the implications of our findings for the design and study of social media-based algorithms.
引用
收藏
页码:1167 / 1178
页数:12
相关论文
共 67 条
[1]  
Abdullah S., 2015, CSCW
[2]  
[Anonymous], BLOOMBERG TECHNOLOGY
[3]  
[Anonymous], WWW 09
[4]  
[Anonymous], 2010, WWW
[5]  
[Anonymous], EMNLP CONLL
[6]  
[Anonymous], PROFESSIONAL GEOGRAP
[7]  
[Anonymous], ICWSM
[8]  
[Anonymous], ICWSM
[9]  
[Anonymous], CONFOUNDS CONSEQUENC
[10]  
[Anonymous], 2011, PEW INTERNET AM LIFE