An in-depth examination of requirements for disclosure risk assessment

被引:2
|
作者
Jarmin, Ron S. [1 ]
Abowd, John M. [2 ]
Ashmead, Robert [1 ]
Cumings-Menon, Ryan [1 ]
Goldschlag, Nathan [1 ]
Hawes, Michael B. [1 ]
Keller, Sallie Ann [1 ,3 ]
Kifer, Daniel [1 ,4 ]
Leclerc, Philip [1 ]
Reiter, Jerome P. [1 ,5 ]
Rodriguez, Rolando A. [1 ]
Schmutte, Ian [6 ]
Velkoff, Victoria A. [1 ]
Zhuravlev, Pavel [1 ]
机构
[1] US Bur Census, Off Deputy Director, Washington, DC 20233 USA
[2] Cornell Univ, Dept Econ, Ithaca, NY 14853 USA
[3] Univ Virginia, Biocomplex Inst, Charlottesville, VA 22904 USA
[4] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[5] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
[6] Univ Georgia, Dept Econ, Athens, GA 30602 USA
关键词
federal statistical system; data disclosure risk; data access; DIFFERENTIAL PRIVACY;
D O I
10.1073/pnas.2220558120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. We argue that any proposal for quantifying disclosure risk should be based on prespecified, objective criteria. We illustrate this approach to evaluate the absolute disclosure risk framework, the counterfactual framework underlying differential privacy, and prior-to-posterior comparisons. We conclude that satisfying all the desiderata is impossible, but counterfactual comparisons satisfy the most while absolute disclosure risk satisfies the fewest. Furthermore, we explain that many of the criticisms levied against differential privacy would be levied against any technology that is not equivalent to direct, unrestricted access to confidential data. More research is needed, but in the near term, the counterfactual approach appears best-suited for privacy versus utility analysis.
引用
收藏
页数:10
相关论文
共 7 条
  • [1] Evaluating the Risk of Disclosure and Utility in a Synthetic Dataset
    Chen, Kang-Cheng
    Yu, Chia-Mu
    Dargahi, Tooska
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 761 - 787
  • [2] Evaluating the Risk of Data Disclosure Using Noise Estimation for Differential Privacy
    Chen, Hung-Li
    Chen, Jia-Yang
    Tsou, Yao-Tung
    Yu, Chia-Mu
    Tai, Bo-Chen
    Li, Szu-Chuang
    Huang, Yennun
    Lin, Chia-Ming
    2017 IEEE 22ND PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC 2017), 2017, : 339 - 347
  • [3] Disclosure Risk from Homogeneity Attack in Differentially Private Release of Frequency Distribution
    Liu, Fang
    Zhao, Xingyuan
    CODASPY'22: PROCEEDINGS OF THE TWELVETH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2022, : 343 - 345
  • [4] Disclosure Risk From Homogeneity Attack in Differentially Privately Sanitized Frequency Distribution
    Liu, Fang
    Zhao, Xingyuan
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (05) : 3927 - 3939
  • [5] RoD: Evaluating the Risk of Data Disclosure Using Noise Estimation for Differential Privacy
    Tsou, Yao-Tung
    Chen, Hung-Li
    Chen, Jia-Yang
    IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (01) : 214 - 226
  • [6] Evaluating Identity Disclosure Risk in Fully Synthetic Health Data: Model Development and Validation
    El Emam, Khaled
    Mosquera, Lucy
    Bass, Jason
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (11)
  • [7] Experimental Evaluation for Risk Assessment of Privacy Preserving Synthetic Data
    Chida, Koji
    Kakuta, Susumu
    Itakura, Hiroyuki
    Ishihara, Ichiro
    Yoshioka, Kosuke
    Takeuchi, Hiroshi
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2024, 2024, 14986 : 224 - 236