A Supermodularity-Based Differential Privacy Preserving Algorithm for Data Anonymization

被引:20
|
作者
Fouad, Mohamed R. [1 ]
Elbassioni, Khaled [2 ]
Bertino, Elisa [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Differential privacy; security; risk management; data sharing; data utility; anonymity; scalability;
D O I
10.1109/TKDE.2013.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximizing data usage and minimizing privacy risk are two conflicting goals. Organizations always apply a set of transformations on their data before releasing it. While determining the best set of transformations has been the focus of extensive work in the database community, most of this work suffered from one or both of the following major problems: scalability and privacy guarantee. Differential Privacy provides a theoretical formulation for privacy that ensures that the system essentially behaves the same way regardless of whether any individual is included in the database. In this paper, we address both scalability and privacy risk of data anonymization. We propose a scalable algorithm that meets differential privacy when applying a specific random sampling. The contribution of the paper is two-fold: 1) we propose a personalized anonymization technique based on an aggregate formulation and prove that it can be implemented in polynomial time; and 2) we show that combining the proposed aggregate formulation with specific sampling gives an anonymization algorithm that satisfies differential privacy. Our results rely heavily on exploring the supermodularity properties of the risk function, which allow us to employ techniques from convex optimization. Through experimental studies we compare our proposed algorithm with other anonymization schemes in terms of both time and privacy risk.
引用
收藏
页码:1591 / 1601
页数:11
相关论文
共 50 条
  • [41] Privacy-preserving Anonymization with Restricted Search (PARS) on Social Network Data for Criminal Investigations
    Asif, Waciar
    Ray, Indranil Ghosh
    Tahir, Shahzaib
    Rajarajan, Muttukrishnan
    2018 19TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2018, : 329 - 334
  • [42] From t-closeness to differential privacy and vice versa in data anonymization
    Domingo-Ferrer, Josep
    Soria-Comas, Jordi
    KNOWLEDGE-BASED SYSTEMS, 2015, 74 : 151 - 158
  • [43] Pattern Anonymization: Hybridizing Data Restructure with Feature Set Partitioning for Privacy Preserving in Supervised Learning
    Riyazuddin, M. D.
    Balaram, V. V. S. S. S.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 603 - 614
  • [44] Individual Differential Privacy: A Utility-Preserving Formulation of Differential Privacy Guarantees
    Soria-Comas, Jordi
    Domingo-Ferrer, Josep
    Sanchez, David
    Megias, David
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (06) : 1418 - 1429
  • [45] An Efficient Big Data Anonymization Algorithm Based on Chaos and Perturbation Techniques
    Eyupoglu, Can
    Aydin, Muhammed Ali
    Zaim, Abdul Halim
    Sertbas, Ahmet
    ENTROPY, 2018, 20 (05)
  • [46] PPMM-DA: Privacy-Preserving Multidimensional and Multisubset Data Aggregation With Differential Privacy for Fog-Based Smart Grids
    Zhao, Shuai
    Xu, Shuhua
    Han, Song
    Ren, Siqi
    Wang, Ye
    Chen, Zhixian
    Chen, Xiaoli
    Lin, Jianhong
    Liu, Weinan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (04): : 6096 - 6110
  • [47] Differential Privacy in Privacy-Preserving Big Data and Learning: Challenge and Opportunity
    Jiang, Honglu
    Gao, Yifeng
    Sarwar, S. M.
    GarzaPerez, Luis
    Robin, Mahmudul
    SILICON VALLEY CYBERSECURITY CONFERENCE, SVCC 2021, 2022, 1536 : 33 - 44
  • [48] Parking recommender system privacy preservation through anonymization and differential privacy
    Saleem, Yasir
    Rehmani, Mubashir Husain
    Crespi, Noel
    Minerva, Roberto
    ENGINEERING REPORTS, 2021, 3 (02)
  • [49] A Framework for Efficient Data Anonymization under Privacy and Accuracy Constraints
    Ghinita, Gabriel
    Karras, Panagiotis
    Kalnis, Panos
    Mamoulis, Nikos
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [50] Privacy-Preserving Bin-Packing With Differential Privacy
    Li, Tianyu
    Erkin, Zekeriya
    Lagendijk, Reginald L.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2022, 3 : 94 - 106