ON ASSESSING THE DISCLOSURE RISK OF CONTROLLED ADJUSTMENT METHODS FOR STATISTICAL TABULAR DATA

被引:8
|
作者
Castro, Jordi [1 ]
机构
[1] Univ Politecn Cataluna, Dept Stat & Operat Res, ES-08034 Barcelona, Catalonia, Spain
关键词
Statistical disclosure control; controlled tabular adjustment; disclosure risk; optimization; linear programming; quadratic programming;
D O I
10.1142/S0218488512400314
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimum distance controlled tabular adjustment is a recent perturbative approach for statistical disclosure control in tabular data. Given a table to be protected, it looks for the closest safe table, using some particular distance. Controlled adjustment is known to provide high data utility. However, the disclosure risk has only been partially analyzed using theoretical results from optimization. This work extends these previous results, providing both a more detailed theoretical analysis, and an extensive empirical assessment of the disclosure risk of the method. A set of 25 instances from the literature and four different attacker scenarios are considered, with several random replications for each scenario, both for L-1 and L-2 distances. This amounts to the solution of more than 2000 optimization problems. The analysis of the results shows that the approach has low disclosure risk when the attacker has no good information on the bounds of the optimization problem. On the other hand, when the attacker has good estimates of the bounds, and the only uncertainty is in the objective function (which is a very strong assumption), the disclosure risk of controlled adjustment is high and it should be avoided.
引用
收藏
页码:921 / 941
页数:21
相关论文
共 50 条
  • [31] Assessing the Information Loss of Controlled Adjustment Methods in Two-Way Tables
    Castro, Jordi
    Gonzalez, Jose A.
    PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 11 - 23
  • [32] Statistical methods for assessing drug interactions using observational data
    Xu, Qian
    Antimisiaris, Demetra
    Kong, Maiying
    JOURNAL OF APPLIED STATISTICS, 2024, 51 (02) : 298 - 323
  • [33] Survey on methods for tabular data protection in ARGUS
    Giessing, S
    PRIVACY IN STATISTICAL DATABASES, PROCEEDINGS, 2004, 3050 : 1 - 13
  • [34] An Empirical Study of Utility and Disclosure Risk for Tabular Data Synthesis Models: In-Depth Analysis and Interesting Findings
    Park, Dae-Young
    Ko, In-Young
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 67 - 74
  • [35] Recent deep learning methods for tabular data
    Hwang, Yejin
    Song, Jongwoo
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2023, 30 (02) : 215 - 226
  • [36] Minimum-distance controlled perturbation methods for large-scale tabular data protection
    Castro, J
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 171 (01) : 39 - 52
  • [37] Differential Privacy and Statistical Disclosure Risk Measures: An Investigation with Binary Synthetic Data
    McClure, David
    Reiter, Jerome P.
    TRANSACTIONS ON DATA PRIVACY, 2012, 5 (03) : 535 - 552
  • [38] Assessing Disclosure Risk for Record Linkage
    Skinner, Chris
    PRIVACY IN STATISTICAL DATABASES, PROCEEDINGS, 2008, 5262 : 166 - 176
  • [39] A tutorial in assessing disclosure risk in microdata
    Taylor, Leslie
    Zhou, Xiao-Hua
    Rise, Peter
    STATISTICS IN MEDICINE, 2018, 37 (25) : 3693 - 3706
  • [40] NormalizeMets: assessing, selecting and implementing statistical methods for normalizing metabolomics data
    Alysha M. De Livera
    Gavriel Olshansky
    Julie A. Simpson
    Darren J. Creek
    Metabolomics, 2018, 14