Differentially private genome data dissemination through top-down specialization

被引:23
作者
Wang, Shuang [1 ]
Mohammed, Noman [2 ]
Chen, Rui [3 ]
机构
[1] Univ Calif San Diego, Div Biomed Informat, San Diego, CA 92093 USA
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
[3] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
ANONYMITY;
D O I
10.1186/1472-6947-14-S1-S2
中图分类号
R-058 [];
学科分类号
摘要
Advanced sequencing techniques make large genome data available at an unprecedented speed and reduced cost. Genome data sharing has the potential to facilitate significant medical breakthroughs. However, privacy concerns have impeded efficient genome data sharing. In this paper, we present a novel approach for disseminating genomic data while satisfying differential privacy. The proposed algorithm splits raw genome sequences into blocks, subdivides the blocks in a top-down fashion, and finally adds noise to counts to preserve privacy. The experimental results suggest that the proposed algorithm can retain certain data utility in terms of a high sensitivity.
引用
收藏
页数:7
相关论文
共 14 条
[1]  
[Anonymous], 2012, PRIVACY PROGR WHOLE
[2]  
[Anonymous], 2009, Privacy integrated queries: an extensible platform for privacy-preserving data analysis
[3]   Calibrating noise to sensitivity in private data analysis [J].
Dwork, Cynthia ;
McSherry, Frank ;
Nissim, Kobbi ;
Smith, Adam .
THEORY OF CRYPTOGRAPHY, PROCEEDINGS, 2006, 3876 :265-284
[4]   The Mastermind Attack on Genomic Data [J].
Goodrich, Michael T. .
PROCEEDINGS OF THE 2009 30TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2009, :204-218
[5]   Identifying Personal Genomes by Surname Inference [J].
Gymrek, Melissa ;
McGuire, Amy L. ;
Golan, David ;
Halperin, Eran ;
Erlich, Yaniv .
SCIENCE, 2013, 339 (6117) :321-324
[6]   Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays [J].
Homer, Nils ;
Szelinger, Szabolcs ;
Redman, Margot ;
Duggan, David ;
Tembe, Waibhav ;
Muehling, Jill ;
Pearson, John V. ;
Stephan, Dietrich A. ;
Nelson, Stanley F. ;
Craig, David W. .
PLOS GENETICS, 2008, 4 (08)
[7]   How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems [J].
Malin, B ;
Sweeney, L .
JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (03) :179-192
[8]   An evaluation of the current state of genomic data privacy protection technology and a roadmap for the future [J].
Malin, BA .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (01) :28-34
[9]   Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule [J].
Malin, Bradley ;
Benitez, Kathleen ;
Masys, Daniel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (01) :3-10
[10]   The Complexities of Genomic Identifiability [J].
Rodriguez, Laura L. ;
Brooks, Lisa D. ;
Greenberg, Judith H. ;
Green, Eric D. .
SCIENCE, 2013, 339 (6117) :275-276