Protecting patient privacy in survival analyses

被引:16
作者
Bonomi, Luca [1 ]
Jiang, Xiaoqian [2 ]
Ohno-Machado, Lucila [1 ,3 ]
机构
[1] Univ Calif San Diego, UC San Diego Hlth, Dept Biomed Informat, La Jolla, CA 92093 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX 77030 USA
[3] VA San Diego Healthcare Syst, Div Hlth Serv Res & Dev, La Jolla, CA USA
关键词
data privacy; survival analysis; data sharing; Kaplan-Meier; actuarial; REGRESSION-MODELS; RISK; PREDICTORS; NETWORK;
D O I
10.1093/jamia/ocz195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Survival analysis is the cornerstone of many healthcare applications in which the "survival" probability (eg, time free from a certain disease, time to death) of a group of patients is computed to guide clinical decisions. It is widely used in biomedical research and healthcare applications. However, frequent sharing of exact survival curves may reveal information about the individual patients, as an adversary may infer the presence of a person of interest as a participant of a study or of a particular group. Therefore, it is imperative to develop methods to protect patient privacy in survival analysis. Materials and Methods: We develop a framework based on the formal model of differential privacy, which provides provable privacy protection against a knowledgeable adversary. We show the performance of privacy-protecting solutions for the widely used Kaplan-Meier nonparametric survival model. Results: We empirically evaluated the usefulness of our privacy-protecting framework and the reduced privacy risk for a popular epidemiology dataset and a synthetic dataset. Results show that our methods significantly reduce the privacy risk when compared with their nonprivate counterparts, while retaining the utility of the survival curves. Discussion: The proposed framework demonstrates the feasibility of conducting privacy-protecting survival analyses. We discuss future research directions to further enhance the usefulness of our proposed solutions in biomedical research applications. Conclusion: The results suggest that our proposed privacy-protection methods provide strong privacy protections while preserving the usefulness of survival analyses.
引用
收藏
页码:366 / 375
页数:10
相关论文
共 65 条
[21]   Flexible survival regression modelling [J].
Cortese, Giuliana ;
Scheike, Thomas H. ;
Martinussen, Torben .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) :5-28
[22]  
COX DR, 1972, J R STAT SOC B, V34, P187
[23]  
CUTLER S J, 1958, J Chronic Dis, V8, P699, DOI 10.1016/0021-9681(58)90126-7
[24]   Differential privacy: A survey of results [J].
Dwork, Cynthia .
THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 :1-19
[25]   Calibrating noise to sensitivity in private data analysis [J].
Dwork, Cynthia ;
McSherry, Frank ;
Nissim, Kobbi ;
Smith, Adam .
THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284
[26]   Pure Differential Privacy for Rectangle Queries via Private Partitions [J].
Dwork, Cynthia ;
Naor, Moni ;
Reingold, Omer ;
Rothblum, Guy N. .
ADVANCES IN CRYPTOLOGY - ASIACRYPT 2015, PT II, 2015, 9453 :735-751
[27]   The Algorithmic Foundations of Differential Privacy [J].
Dwork, Cynthia ;
Roth, Aaron .
FOUNDATIONS AND TRENDS IN THEORETICAL COMPUTER SCIENCE, 2013, 9 (3-4) :211-406
[28]  
Dwork C, 2010, ACM S THEORY COMPUT, P715
[29]   Lung function tests in patients with idiopathic pulmonary fibrosis - Are they helpful for predicting outcome? [J].
Erbes, R ;
Schaberg, T ;
Loddenkemper, R .
CHEST, 1997, 111 (01) :51-57
[30]   Monitoring Web Browsing Behavior with Differential Privacy [J].
Fan, Liyue ;
Bonomi, Luca ;
Xiong, Li ;
Sunderam, Vaidy .
WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, :177-187