Anonymization of Longitudinal Electronic Medical Records

被引:38
作者
Tamersoy, Acar [1 ]
Loukides, Grigorios [2 ]
Nergiz, Mehmet Ercan [3 ]
Saygin, Yucel [4 ]
Malin, Bradley [1 ]
机构
[1] Vanderbilt Univ, Dept Biomed Informat, Nashville, TN 37232 USA
[2] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 3AA, S Glam, Wales
[3] Zirve Univ, Dept Comp Engn, TR-27260 Gaziantep, Turkey
[4] Sabanci Univ, Dept Comp Sci & Engn, TR-34956 Istanbul, Turkey
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2012年 / 16卷 / 03期
基金
美国国家卫生研究院;
关键词
Anonymization; data privacy; electronic medical records (EMRs); longitudinal data; K-ANONYMITY; IDENTIFICATION; DISCLOSURE; SYSTEMS;
D O I
10.1109/TITB.2012.2185850
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Electronic medical record (EMR) systems have enabled healthcare providers to collect detailed patient information from the primary care domain. At the same time, longitudinal data from EMRs are increasingly combined with biorepositories to generate personalized clinical decision support protocols. Emerging policies encourage investigators to disseminate such data in a deidentified form for reuse and collaboration, but organizations are hesitant to do so because they fear such actions will jeopardize patient privacy. In particular, there are concerns that residual demographic and clinical features could be exploited for reidentification purposes. Various approaches have been developed to anonymize clinical data, but they neglect temporal information and are, thus, insufficient for emerging biomedical research paradigms. This paper proposes a novel approach to share patient-specific longitudinal data that offers robust privacy guarantees, while preserving data utility for many biomedical investigations. Our approach aggregates temporal and diagnostic information using heuristics inspired from sequence alignment and clustering methods. We demonstrate that the proposed approach can generate anonymized data that permit effective biomedical analysis using several patient cohorts derived from the EMR system of the Vanderbilt University Medical Center.
引用
收藏
页码:413 / 423
页数:11
相关论文
共 58 条
  • [11] Cormen T., 2001, Introduction to Algorithms
  • [12] Dalenius Tore, 1986, J. Off. Stat., V2, P329
  • [13] Use of Electronic Medical Records for Health Outcomes Research A Literature Review
    Dean, Bonnie B.
    Lam, Jessica
    Natoli, Jaime L.
    Butler, Qiana
    Aguilar, Daniel
    Nordyke, Robert J.
    [J]. MEDICAL CARE RESEARCH AND REVIEW, 2009, 66 (06) : 611 - 638
  • [14] Identification of Genomic Predictors of Atrioventricular Conduction Using Electronic Medical Records as a Tool for Genome Science
    Denny, Joshua C.
    Ritchie, Marylyn D.
    Crawford, Dana C.
    Schildcrout, Jonathan S.
    Ramirez, Andrea H.
    Pulley, Jill M.
    Basford, Melissa A.
    Masys, Daniel R.
    Haines, Jonathan L.
    Roden, Dan M.
    [J]. CIRCULATION, 2010, 122 (20) : 2016 - 2021
  • [15] A Survey of Confidential Data Storage and Deletion Methods
    Diesburg, Sarah M.
    Wang, An-I Andy
    [J]. ACM COMPUTING SURVEYS, 2010, 43 (01)
  • [16] Ordinal, continuous and heterogeneous k-anonymity through microaggregation
    Domingo-Ferrer, J
    Torra, V
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (02) : 195 - 212
  • [17] Practical data-oriented microaggregation for statistical disclosure control
    Domingo-Ferrer, J
    Mateo-Sanz, JM
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 189 - 201
  • [18] Protecting privacy using k-anonymity
    El Emam, Khaled
    Dankar, Fida Kamal
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (05) : 627 - 637
  • [19] A Globally Optimal k-Anonymity Method for the De-Identification of Health Data
    El Emam, Khaled
    Dankar, Fida Kamal
    Issa, Romeo
    Jonker, Elizabeth
    Amyot, Daniel
    Cogo, Elise
    Corriveau, Jean-Pierre
    Walker, Mark
    Chowdhury, Sadrul
    Vaillancourt, Regis
    Roffey, Tyson
    Bottomley, Jim
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2009, 16 (05) : 670 - 682
  • [20] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    [J]. ACM COMPUTING SURVEYS, 2010, 42 (04)