Implications of Data Anonymization on the Statistical Evidence of Disparity

被引:7
|
作者
Xu, Heng [1 ]
Zhang, Nan [1 ]
机构
[1] Amer Univ, Kogod Sch Business, Washington, DC 20016 USA
基金
美国国家科学基金会;
关键词
privacy; data anonymization; discrimination; statistical disparity; DIFFERENTIAL PRIVACY; HEALTH DISPARITIES; K-ANONYMITY; BIAS; DISCRIMINATION; PROTECTION; ACCURACY; SECURITY; PROOF;
D O I
10.1287/mnsc.2021.4028
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Research and practical development of data-anonymization techniques have proliferated in recent years. Yet, limited attention has been paid to examine the potentially disparate impact of privacy protection on underprivileged subpopulations. This study is one of the first attempts to examine the extent to which data anonymization could mask the gross statistical disparities between subpopulations in the data. We first describe two common mechanisms of data anonymization and two prevalent types of statistical evidence for disparity. Then, we develop conceptual foundation and mathematical formalism demonstrating that the two data-anonymization mechanisms have distinctive impacts on the identifiability of disparity, which also varies based on its statistical operationalization. After validating our findings with empirical evidence, we discuss the business and policy implications, highlighting the need for firms and policy makers to balance between the protection of privacy and the recognition/rectification of disparate impact.
引用
收藏
页码:2600 / 2618
页数:20
相关论文
共 50 条
  • [21] Anonymization of Data Sets with NULL Values
    Ciglic, Margareta
    Eder, Johann
    Koncilia, Christian
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXIV, 2016, 9510 : 193 - 220
  • [22] Data anonymization evaluation for big data and IoT environment
    Ni, Chunchun
    Cang, Li Shan
    Gope, Prosanta
    Min, Geyong
    INFORMATION SCIENCES, 2022, 605 : 381 - 392
  • [23] Managing dimensionality in data privacy anonymization
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (01) : 341 - 373
  • [24] SUHDSA: Secure, Useful, and High-Performance Data Stream Anonymization
    Joo, Yongwan
    Kim, Soonseok
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 9336 - 9347
  • [25] Spectral Anonymization of Data
    Lasko, Thomas A.
    Vinterbo, Staal A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (03) : 437 - 446
  • [26] Patterns of Data Anonymization
    Monteiro, Mariana
    Correia, Filipe
    Queiroz, Paulo
    Ramos, Rui
    Trigo, Dinis
    Goncalves, Goncalo
    PROCEEDINGS OF THE 29TH EUROPEAN CONFERENCE ON PATTERN LANGUAGES OF PROGRAMS, PEOPLE, AND PRACTICES, EUROPLOP 2024, 2024,
  • [27] Distributed Data Anonymization
    SheikhAlishahi, Mina
    Martinelli, Fabio
    IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 580 - 586
  • [28] Information based data anonymization for classification utility
    Li, Jiuyong
    Liu, Jixue
    Baig, Muzammil
    Wong, Raymond Chi-Wing
    DATA & KNOWLEDGE ENGINEERING, 2011, 70 (12) : 1030 - 1045
  • [29] Smart Anonymity: a mechanism for recommending data anonymization algorithms based on data profiles for IoT environments
    Neves, Flavio
    Souza, Rafael
    Lima, Wesley
    Raul, Wellison
    Bonfim, Michel
    Garcia, Vinicius
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (14) : 20956 - 21000
  • [30] Modeling and Integrating Background Knowledge in Data Anonymization
    Li, Tiancheng
    Li, Ninghui
    Zhang, Jian
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 6 - +