A Supermodularity-Based Differential Privacy Preserving Algorithm for Data Anonymization

被引:20
|
作者
Fouad, Mohamed R. [1 ]
Elbassioni, Khaled [2 ]
Bertino, Elisa [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Differential privacy; security; risk management; data sharing; data utility; anonymity; scalability;
D O I
10.1109/TKDE.2013.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximizing data usage and minimizing privacy risk are two conflicting goals. Organizations always apply a set of transformations on their data before releasing it. While determining the best set of transformations has been the focus of extensive work in the database community, most of this work suffered from one or both of the following major problems: scalability and privacy guarantee. Differential Privacy provides a theoretical formulation for privacy that ensures that the system essentially behaves the same way regardless of whether any individual is included in the database. In this paper, we address both scalability and privacy risk of data anonymization. We propose a scalable algorithm that meets differential privacy when applying a specific random sampling. The contribution of the paper is two-fold: 1) we propose a personalized anonymization technique based on an aggregate formulation and prove that it can be implemented in polynomial time; and 2) we show that combining the proposed aggregate formulation with specific sampling gives an anonymization algorithm that satisfies differential privacy. Our results rely heavily on exploring the supermodularity properties of the risk function, which allow us to employ techniques from convex optimization. Through experimental studies we compare our proposed algorithm with other anonymization schemes in terms of both time and privacy risk.
引用
收藏
页码:1591 / 1601
页数:11
相关论文
共 50 条
  • [21] An Anonymization Algorithm for (α,β,γ,δ)-Social Network Privacy Considering Data Utility
    Rajaei, Mehri
    Haghjoo, Mostafa S.
    Miyaneh, Eynollah Khanjari
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2015, 21 (02) : 268 - 305
  • [22] Research on differential privacy preserving clustering algorithm based on spark platform
    Meng Q.
    Zhou L.
    Journal of Computers (Taiwan), 2018, 29 (01) : 47 - 62
  • [23] Privacy Preserving Attribute-Focused Anonymization Scheme for Healthcare Data Publishing
    Onesimu, J. Andrew
    Karthikeyan, J.
    Eunice, Jennifer
    Pomplun, Marc
    Hien Dang
    IEEE ACCESS, 2022, 10 : 86979 - 86997
  • [24] A privacy-preserving model based on differential approach for sensitive data in cloud environment
    Singh, Ashutosh Kumar
    Gupta, Rishabh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33127 - 33150
  • [25] Privacy Preserving Big Data Publication On Cloud Using Mondrian Anonymization Techniques and Deep Neural Networks
    Andrew, J.
    Karthikeyan, J.
    Jebastin, Jeffy
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 722 - 727
  • [26] Privacy preserving and data publication for vehicular trajectories with differential privacy
    Arif, Muhammad
    Chen, Jianer
    Wang, Guojun
    Geman, Oana
    Balas, Valentina Emilia
    MEASUREMENT, 2021, 173
  • [27] Design of a privacy-preserving algorithm for peer-to-peer network based on differential privacy
    Yu J.
    Ingenierie des Systemes d'Information, 2019, 24 (04): : 433 - 437
  • [28] Data Incremental Clustering Algorithm based on Differential Privacy
    Gao, Qing
    Wang, Xiujun
    Gao, Yan
    Tao, Tao
    2023 IEEE 9TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2023,
  • [29] A clustering-based anonymization approach for privacy-preserving in the healthcare cloud
    Abbasi, Afsoon
    Mohammadi, Behnaz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01)
  • [30] A differential privacy preserving algorithm for greedy decision tree
    Yang, Shudan
    Li, Nan
    Sun, Daozhu
    Du, Qiming
    Liu, Wenfu
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 229 - 237