On inferring presence of an individual in a mixture: a Bayesian approach

被引:19
作者
Clayton, David [1 ,2 ]
机构
[1] Univ Cambridge, Wellcome Trust Juvenile Diabet Res Fdn, Addenbrookes Hosp, Diabet & Inflammat Lab, Cambridge CB2 0XY, England
[2] Univ Cambridge, Dept Med Genet, Addenbrookes Hosp, Cambridge Inst Med Res, Cambridge CB2 0XY, England
基金
英国惠康基金;
关键词
Bayesian analysis; Data confidentiality; Statistical genetics; GENOME-WIDE ASSOCIATION; SELECTION; LASSO;
D O I
10.1093/biostatistics/kxq035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Homer and others (2008. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PLoS Genetics 4, e1000167) recently showed that, given allele frequency data for a large number of single nucleotide polymorphisms in a sample together with corresponding population "reference" frequencies, by typing an individual's DNA sample at the same set of loci it can be inferred whether or not the individual was a member of the sample. This observation has been responsible for precautionary removal of large amounts of summary data from public access. This and further work on the problem has followed a frequentist approach. This paper sets out a Bayesian analysis of this problem which clarifies the role of the reference frequencies and allows incorporation of prior probabilities of the individual's membership in the sample.
引用
收藏
页码:661 / 673
页数:13
相关论文
共 10 条
  • [1] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [2] Least angle regression - Rejoinder
    Efron, B
    Hastie, T
    Johnstone, I
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
  • [3] Sparse inverse covariance estimation with the graphical lasso
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Robert
    [J]. BIOSTATISTICS, 2008, 9 (03) : 432 - 441
  • [4] The International HapMap Project
    Gibbs, RA
    Belmont, JW
    Hardenbol, P
    Willis, TD
    Yu, FL
    Yang, HM
    Ch'ang, LY
    Huang, W
    Liu, B
    Shen, Y
    Tam, PKH
    Tsui, LC
    Waye, MMY
    Wong, JTF
    Zeng, CQ
    Zhang, QR
    Chee, MS
    Galver, LM
    Kruglyak, S
    Murray, SS
    Oliphant, AR
    Montpetit, A
    Hudson, TJ
    Chagnon, F
    Ferretti, V
    Leboeuf, M
    Phillips, MS
    Verner, A
    Kwok, PY
    Duan, SH
    Lind, DL
    Miller, RD
    Rice, JP
    Saccone, NL
    Taillon-Miller, P
    Xiao, M
    Nakamura, Y
    Sekine, A
    Sorimachi, K
    Tanaka, T
    Tanaka, Y
    Tsunoda, T
    Yoshino, E
    Bentley, DR
    Deloukas, P
    Hunt, S
    Powell, D
    Altshuler, D
    Gabriel, SB
    Qiu, RZ
    [J]. NATURE, 2003, 426 (6968) : 789 - 796
  • [5] Investigation of the fine structure of European populations with applications to disease association studies
    Heath, Simon C.
    Gut, Ivo G.
    Brennan, Paul
    McKay, James D.
    Bencko, Vladimir
    Fabianova, Eleonora
    Foretova, Lenka
    Georges, Michel
    Janout, Vladimir
    Kabesch, Michael
    Krokan, Hans E.
    Elvestad, Maiken B.
    Lissowska, Jolanta
    Mates, Dana
    Rudnai, Peter
    Skorpen, Frank
    Schreiber, Stefan
    Soria, Jose M.
    Syvanen, Ann-Christine
    Meneton, Pierre
    Hercberg, Serge
    Galan, Pilar
    Szeszenia-Dabrowska, Neonilia
    Zaridze, David
    Genin, Emmanuel
    Cardon, Lon R.
    Lathrop, Mark
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (12) : 1413 - 1429
  • [6] Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays
    Homer, Nils
    Szelinger, Szabolcs
    Redman, Margot
    Duggan, David
    Tembe, Waibhav
    Muehling, Jill
    Pearson, John V.
    Stephan, Dietrich A.
    Nelson, Stanley F.
    Craig, David W.
    [J]. PLOS GENETICS, 2008, 4 (08)
  • [7] A new statistic and its power to infer membership in a genome-wide association study using genotype frequencies
    Jacobs, Kevin B.
    Yeager, Meredith
    Wacholder, Sholom
    Craig, David
    Kraft, Peter
    Hunter, David J.
    Paschal, Justin
    Manolio, Teri A.
    Tucker, Margaret
    Hoover, Robert N.
    Thomas, Gilles D.
    Chanock, Stephen J.
    Chatterjee, Nilanjan
    [J]. NATURE GENETICS, 2009, 41 (11) : 1253 - U126
  • [8] High-dimensional graphs and variable selection with the Lasso
    Meinshausen, Nicolai
    Buehlmann, Peter
    [J]. ANNALS OF STATISTICS, 2006, 34 (03) : 1436 - 1462
  • [9] A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics
    Schäfer, J
    Strimmer, K
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2005, 4 : 1 - 30
  • [10] Model selection and estimation in the Gaussian graphical model
    Yuan, Ming
    Lin, Yi
    [J]. BIOMETRIKA, 2007, 94 (01) : 19 - 35