A Supermodularity-Based Differential Privacy Preserving Algorithm for Data Anonymization

被引:20
|
作者
Fouad, Mohamed R. [1 ]
Elbassioni, Khaled [2 ]
Bertino, Elisa [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
Differential privacy; security; risk management; data sharing; data utility; anonymity; scalability;
D O I
10.1109/TKDE.2013.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Maximizing data usage and minimizing privacy risk are two conflicting goals. Organizations always apply a set of transformations on their data before releasing it. While determining the best set of transformations has been the focus of extensive work in the database community, most of this work suffered from one or both of the following major problems: scalability and privacy guarantee. Differential Privacy provides a theoretical formulation for privacy that ensures that the system essentially behaves the same way regardless of whether any individual is included in the database. In this paper, we address both scalability and privacy risk of data anonymization. We propose a scalable algorithm that meets differential privacy when applying a specific random sampling. The contribution of the paper is two-fold: 1) we propose a personalized anonymization technique based on an aggregate formulation and prove that it can be implemented in polynomial time; and 2) we show that combining the proposed aggregate formulation with specific sampling gives an anonymization algorithm that satisfies differential privacy. Our results rely heavily on exploring the supermodularity properties of the risk function, which allow us to employ techniques from convex optimization. Through experimental studies we compare our proposed algorithm with other anonymization schemes in terms of both time and privacy risk.
引用
收藏
页码:1591 / 1601
页数:11
相关论文
共 50 条
  • [31] Segment Clustering Based Privacy Preserving Algorithm for Trajectory Data Publishing
    Li Fengyun
    Xue Junchao
    Sun Dawei
    Gao Yanfang
    WIRELESS SENSOR NETWORKS (CWSN 2017), 2018, 812 : 211 - 221
  • [32] A Scalable (α, k)-Anonymization Approach using MapReduce for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Gupta, Ruchika
    Rao, Udai Pratap
    Muthiyan, Mukesh
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [33] Privacy-Preserving Data Analytics in Internet of Medical Things
    Mudassar, Bakhtawar
    Tahir, Shahzaib
    Khan, Fawad
    Shah, Syed Aziz
    Shah, Syed Ikram
    Abbasi, Qammer Hussain
    FUTURE INTERNET, 2024, 16 (11)
  • [34] SecDM: privacy-preserving data outsourcing framework with differential privacy
    Dagher, Gaby G.
    Fung, Benjamin C. M.
    Mohammed, Noman
    Clark, Jeremy
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (05) : 1923 - 1960
  • [35] A review of preserving privacy in data collected from buildings with differential privacy
    Janghyun, K.
    Barry, H.
    Tianzhen, H.
    Marc, A. P.
    JOURNAL OF BUILDING ENGINEERING, 2022, 56
  • [36] Anonymization Level and Compliance for Differential Privacy: A Systematic Literature Review
    Prokhorenkov, Dmitry
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 1119 - 1124
  • [37] A Framework for Privacy-Preserving in IoV Using Federated Learning With Differential Privacy
    Adnan, Muhammad
    Syed, Madiha Haider
    Anjum, Adeel
    Rehman, Semeen
    IEEE ACCESS, 2025, 13 : 13507 - 13521
  • [38] A Frequent Itemsets Data Mining Algorithm Based on Differential Privacy
    Li, Qingpeng
    Zhang, Longjun
    Li, Haoyu
    Sun, Wenjun
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 251 - 253
  • [39] Preserving Private Cloud Service Data Based on Hypergraph Anonymization
    Li, Yuechuan
    Li, Yidong
    Zhang, Baopeng
    Shen, Hong
    2013 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2013, : 192 - 197
  • [40] Managing dimensionality in data privacy anonymization
    Hessam Zakerzadeh
    Charu C. Aggarwal
    Ken Barker
    Knowledge and Information Systems, 2016, 49 : 341 - 373