Differential privacy under dependent tuples-the case of genomic privacy

被引:22
作者
Almadhoun, Nour [1 ]
Ayday, Erman [1 ,2 ]
Ulusoy, Ozgur [1 ]
机构
[1] Bilkent Univ, Comp Engn Dept, TR-06800 Ankara, Turkey
[2] Case Western Reserve Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44106 USA
关键词
ATTITUDES; MEDICINE;
D O I
10.1093/bioinformatics/btz837
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The rapid progress in genome sequencing has led to high availability of genomic data. Studying these data can greatly help answer the key questions about disease associations and our evolution. However, due to growing privacy concerns about the sensitive information of participants, accessing key results and data of genomic studies (such as genome-wide association studies) is restricted to only trusted individuals. On the other hand, paving the way to biomedical breakthroughs and discoveries requires granting open access to genomic datasets. Privacy-preserving mechanisms can be a solution for granting wider access to such data while protecting their owners. In particular, there has been growing interest in applying the concept of differential privacy (DP) while sharing summary statistics about genomic data. DP provides a mathematically rigorous approach to prevent the risk of membership inference while sharing statistical information about a dataset. However, DP does not consider the dependence between tuples in the dataset, which may degrade the privacy guarantees offered by the DP. Results: In this work, focusing on genomic datasets, we show this drawback of the DP and we propose techniques to mitigate it. First, using a real-world genomic dataset, we demonstrate the feasibility of an inference attack on differentially private query results by utilizing the correlations between the entries in the dataset. The results show the scale of vulnerability when we have dependent tuples in the dataset. We show that the adversary can infer sensitive genomic data about a user from the differentially private results of a query by exploiting the correlations between the genomes of family members. Second, we propose a mechanism for privacy-preserving sharing of statistics from genomic datasets to attain privacy guarantees while taking into consideration the dependence between tuples. By evaluating our mechanism on different genomic datasets, we empirically demonstrate that our proposed mechanism can achieve up to 50% better privacy than traditional DP-based solutions.
引用
收藏
页码:1696 / 1703
页数:8
相关论文
共 42 条
  • [21] Blowfish Privacy: Tuning Privacy-Utility Trade-offs using Policies
    He, Xi
    Machanavajjhala, Ashwin
    Ding, Bolin
    [J]. SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1447 - 1458
  • [22] Advantages and limitations of next-generation sequencing technologies: A comparison of electrophoresis and non-electrophoresis methods
    Hert, Daniel G.
    Fredlake, Christopher P.
    Barron, Annelise E.
    [J]. ELECTROPHORESIS, 2008, 29 (23) : 4618 - 4626
  • [23] Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays
    Homer, Nils
    Szelinger, Szabolcs
    Redman, Margot
    Duggan, David
    Tembe, Waibhav
    Muehling, Jill
    Pearson, John V.
    Stephan, Dietrich A.
    Nelson, Stanley F.
    Craig, David W.
    [J]. PLOS GENETICS, 2008, 4 (08)
  • [24] Humbert M, 2013, P 2013 ACM SIGSAC C, P1141, DOI DOI 10.1145/2508859.2516707
  • [25] Relationship between public attitudes toward genomic studies related to medicine and their level of genomic literacy in japan
    Ishiyama, Izumi
    Nagai, Akiko
    Muto, Kaori
    Tamakoshi, Akiko
    Kokado, Minori
    Mimura, Kyoko
    Tanzawa, Tetsuro
    Yamagata, Zentaro
    [J]. AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2008, 146A (13) : 1696 - 1706
  • [26] Johnson A, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P1079
  • [27] Kifer D, 2011, P ACM SIGMOD INNT C, P193
  • [28] Kifer D., 2012, P 31 ACM SIGMOD SIGA, P77, DOI DOI 10.1145/2213556.2213571
  • [29] Public involvement in pharmacogenomics research: a national survey on public attitudes towards pharmacogenomics research and the willingness to donate DNA samples to a DNA bank in Japan
    Kobayashi, Eriko
    Satoh, Nobunori
    [J]. CELL AND TISSUE BANKING, 2009, 10 (04) : 281 - 291
  • [30] Beyond Consent: Building Trusting Relationships With Diverse Populations in Precision Medicine Research
    Kraft, Stephanie A.
    Cho, Mildred K.
    Gillespie, Katherine
    Halley, Meghan
    Varsava, Nina
    Ormond, Kelly E.
    Luft, Harold S.
    Wilfond, Benjamin S.
    Lee, Sandra Soo-Jin
    [J]. AMERICAN JOURNAL OF BIOETHICS, 2018, 18 (04) : 3 - 20