On the complexity of optimal microaggregation for statistical disclosure control

被引:1
|
作者
Oganian, Anna [1 ]
Domingo-Ferrer, Josep [1 ]
机构
[1] Universitat Rovira i Virgili, Dept. of Comp. Eng. and Mathematics, Av. Països Catalans 26, E-43007 Tarragona, Catalonia, Spain
来源
Statistical Journal of the United Nations Economic Commission for Europe | 2001年 / 18卷 / 04期
关键词
Algorithms - Computational complexity - Data mining - Database systems - Polynomials - Security of data - Statistical methods;
D O I
10.3233/sju-2001-18409
中图分类号
学科分类号
摘要
Statistical disclosure control (SDC), also termed inference control two decades ago, is an integral part of data security dealing with the protection of statistical data. The basic problem in SDC is to release data in a way that does not lead to disclosure of individual information (high security) but preserves the informational content as much as possible (low information loss). SDC is dual with data mining in the sense that progress of data mining techniques forces official statistics to continuously improve SDC techniques: the more powerful the inferences that can be made on a released data set, the more protection is needed so that no inference jeopardizes the privacy of individual respondents' data. This paper deals with the computational complexity of optimal microaggregation, where optimal means yielding minimal information loss for a fixed security level. More specifically, we show that the problem of optimal microaggregation cannot be exactly solved in polynomial time. This result is relevant because it provides theoretical justification for the lack of exact optimal algorithms and for the current use of heuristic approaches.
引用
收藏
页码:345 / 353
相关论文
共 50 条
  • [1] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto César
    Ochi, Luiz Satoru
    Brito, José André de Moura
    Semaan, Gustavo Silva
    Information Sciences, 2021, 548 : 37 - 55
  • [2] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto Cesar
    Ochi, Luiz Satoru
    Moura Brito, Jose Andre de
    Semaan, Gustavo Silva
    INFORMATION SCIENCES, 2021, 548 : 37 - 55
  • [3] Density-based microaggregation for statistical disclosure control
    Lin, Jun-Lin
    Wen, Tsung-Hsien
    Hsieh, Jui-Chien
    Chang, Pei-Chann
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) : 3256 - 3263
  • [4] Practical data-oriented microaggregation for statistical disclosure control
    Domingo-Ferrer, J
    Mateo-Sanz, JM
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 189 - 201
  • [5] Microaggregation Sorting Framework for K-Anonymity Statistical Disclosure Control in Cloud Computing
    Kabir, Md Enamul
    Mahmood, Abdun Naser
    Wang, Hua
    Mustafa, Abdul K.
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 408 - 417
  • [6] Understanding Microaggregation- A technique of Statistical Disclosure Control for Privacy Preserving and Data Publishing in Inter-Cloud
    Gadad, Veena
    Sowmyarani, C. N.
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [7] On the disclosure risk of multivariate microaggregation
    Nin, Jordi
    Herranz, Javier
    Torra, Vicenc
    DATA & KNOWLEDGE ENGINEERING, 2008, 67 (03) : 399 - 412
  • [8] Analysis of the Univariate Microaggregation Disclosure Risk
    Jordi Nin
    Vicenç Torra
    New Generation Computing, 2009, 27 : 197 - 214
  • [9] Analysis of the Univariate Microaggregation Disclosure Risk
    Nin, Jordi
    Torra, Vicenc
    NEW GENERATION COMPUTING, 2009, 27 (03) : 197 - 214
  • [10] The Parameterized Complexity of Network Microaggregation
    Blazej, Vaclav
    Ganian, Robert
    Knop, Dusan
    Pokorny, Jan
    Schierreich, Simon
    Simonov, Kirill
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 5, 2023, : 6262 - 6270