A robust privacy preserving approach for electronic health records using multiple dataset with multiple sensitive attributes

被引:19
作者
Kanwal, Tehsin [1 ]
Anjum, Adeel [1 ,5 ]
Malik, Saif U. R. [2 ]
Sajjad, Haider [1 ]
Khan, Abid [3 ]
Manzoor, Umar [4 ]
Asheralieva, Alia [5 ]
机构
[1] Comsats Univ Islamabad, Dept Comp Sci, Islamabad, Pakistan
[2] Cybernetica AS, Tallinn, Estonia
[3] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
[4] Univ Hull, Dept Comp, Kingston Upon Hull, N Humberside, England
[5] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Xueyuan Ave, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Electronic Health Record; Identity Disclosure; Sensitive Attribute Disclosure; Balanced p(+) sensitive k anonymity model; Formal Verification; Privacy-Preserving; Multiple Sensitive Attributes (MSAs); K-ANONYMITY;
D O I
10.1016/j.cose.2021.102224
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy preserving data publishing of electronic health record (EHRs) for 1 to M datasets with multiple sensitive attributes (MSAs) is an interesting and challenging issue. There is always a trade-off between privacy and utility in data publishing. Most of the privacy-preserving models shows critical privacy disclosure issues and, hence, they are not robust in practical datasets. The k-anonymity model is a broadly used privacy model to analyze privacy disclo-sures, however, this model is only useful against identity disclosure. To address the limita-tions of k-anonymity, a group of privacy model extensions have been proposed in past years. It includes a p-sensitive k-anonymity model, a p + -sensitive k-anonymity model, and a bal-anced p + -sensitive k-anonymity model. However these privacy-preserving models are not sufficient to preserve the privacy of end-users in practical datasets. In this paper we have formalize the behavior of an adversary which perform identity and attribute disclosures on balanced p +-sensitive k-anonymity model with the help of adversarial scenarios. Since balanced p +-sensitive k-anonymity model is not sufficient for 1 to M with MSAs datasets pri-vacy preservation. We propose an extended privacy model called "1: M MSA-(p, l)-diversity" for 1: M dataset with MSAs. We then perform formal modeling and verification of the pro-posed model using High-Level Petri Nets (HLPN) to confirm privacy attacks invalidation. Ex-perimental results show that our proposed "1: M MSA-(p, l)-diversity model" is efficient and provide enhanced data utility of published data. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:21
相关论文
共 46 条
  • [1] Andres Miguel E, 2013, P 2013 ACM SIGSAC C
  • [2] An efficient privacy mechanism for electronic health records
    Anjum, Adeel
    Malik, Saif Ur Rehman
    Choo, Kim-Kwang Raymond
    Khan, Abid
    Haroon, Asma
    Khan, Sangeen
    Khan, Samee U.
    Ahmad, Naveed
    Raza, Basit
    [J]. COMPUTERS & SECURITY, 2018, 72 : 196 - 211
  • [3] Anjum N. Ahmad, 2018, J. Supercomput., V74, P1
  • [4] Ashwin M., 2007, ACM T KNOWL DISCOV D, V1
  • [5] Bhattacharyya Dhruba K, 2012, INT C COMP SCI INF T
  • [6] Campan A, 2010, TRANS DATA PRIV, V3, P65
  • [7] Chen R, 2011, PROC VLDB ENDOW, V4, P1087
  • [8] Dhumal MT., 2015, INT J RECENT INNOV T, V3, P4261
  • [9] Differential privacy: A survey of results
    Dwork, Cynthia
    [J]. THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 : 1 - 19
  • [10] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    [J]. ACM COMPUTING SURVEYS, 2010, 42 (04)