Simulating systematic bias in attributed social networks and its effect on rankings of minority nodes

被引:0
作者
Leonie Neuhäuser
Felix I. Stamm
Florian Lemmerich
Michael T. Schaub
Markus Strohmaier
机构
[1] RWTH Aachen University,
[2] University of Passau,undefined
[3] GESIS – Leibniz Institute for the Social Sciences,undefined
来源
Applied Network Science | / 6卷
关键词
Edge uncertainty; Rankings; Attributed networks; Bias; Social networks;
D O I
暂无
中图分类号
学科分类号
摘要
Network analysis provides powerful tools to learn about a variety of social systems. However, most analyses implicitly assume that the considered relational data is error-free, and reliable and accurately reflects the system to be analysed. Especially if the network consists of multiple groups (e.g., genders, races), this assumption conflicts with a range of systematic biases, measurement errors and other inaccuracies that are well documented in the literature. To investigate the effects of such errors we introduce a framework for simulating systematic bias in attributed networks. Our framework enables us to model erroneous edge observations that are driven by external node attributes or errors arising from the (hidden) network structure itself. We exemplify how systematic inaccuracies distort conclusions drawn from network analyses on the task of minority representations in degree-based rankings. By analysing synthetic and real networks with varying homophily levels and group sizes, we find that the effect of introducing systematic edge errors depends on both the type of edge error and the level of homophily in the system: in heterophilic networks, minority representations in rankings are very sensitive to the type of systematic edge error. In contrast, in homophilic networks we find that minorities are at a disadvantage regardless of the type of error present. We thus conclude that the implications of systematic bias in edge data depend on an interplay between network topology and type of systematic error. This emphasises the need for an error model framework as developed here, which provides a first step towards studying the effects of systematic edge-uncertainty for various network analysis tasks.
引用
收藏
相关论文
共 95 条
[1]  
Almquist ZW(2012)Random errors in egocentric networks Soc Netw 34 493-505
[2]  
Avella-Medina M(2020)Centrality measures for graphons: accounting for uncertainty in networks IEEE Trans Netw Sci Eng 7 520-537
[3]  
Parise F(2007)Partner naming and forgetting: recall of network members Soc Netw 29 279-299
[4]  
Schaub MT(2006)On the robustness of centrality measures under conditions of imperfect data Soc Netw 28 124-136
[5]  
Segarra S(2020)Automated and partly automated contact tracing: a systematic review to inform the control of COVID-19 Lancet Digit Health 2 e607-e621
[6]  
Bell DC(2000)Forgetting in the recall-based elicitation of personal and social networks Soc Netw 22 29-43
[7]  
Belli-McQueen B(1993)Accuracy and reliability of self-reported data in interorganizational networks Soc Netw 15 377-398
[8]  
Haider A(2004)The effects of social networks on employment and inequality Am Econ Rev 94 426-454
[9]  
Borgatti SP(2008)Hierarchical structure and the prediction of missing links in networks Nature 453 98-101
[10]  
Carley KM(2020)Distributions of centrality on networks Games Econ Behav 2020 27-328