A simple Bayesian mixture model with a hybrid procedure for genome-wide association studies

被引:9
作者
Wei, Yu-Chung [1 ,2 ,3 ]
Wen, Shu-Hui [4 ]
Chen, Pei-Chun [1 ,2 ,5 ]
Wang, Chih-Hao [6 ]
Hsiao, Chuhsing K. [1 ,2 ,5 ]
机构
[1] Natl Taiwan Univ, Dept Publ Hlth, Inst Epidemiol, Taipei 100, Taiwan
[2] Natl Taiwan Univ, Res Ctr Gene Environm & Human Hlth, Taipei 100, Taiwan
[3] Natl Chiao Tung Univ, Inst Stat, Hsinchu, Taiwan
[4] Tzu Chi Univ, Coll Med, Dept Publ Hlth, Hualien, Taiwan
[5] Natl Taiwan Univ, Res Ctr Med Excellence, Taipei 100, Taiwan
[6] Fu Jen Catholic Univ, Coll Med, Cardinal Tien Hosp, Dept Cardiol, Taipei, Taiwan
基金
英国惠康基金;
关键词
Bayesian inference; GWAS; mixture model; WTCCC; FALSE DISCOVERY; RHEUMATOID-ARTHRITIS; POSITIVE REPORT; P-VALUES; PROBABILITY; GENE; EPIDEMIOLOGY; MICROARRAY; SCAN;
D O I
10.1038/ejhg.2010.51
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies often face the undesirable result of either failing to detect any influential markers at all because of a stringent level for testing error corrections or encountering difficulty in quantifying the importance of markers by their P-values. Advocates of estimation procedures prefer to estimate the proportion of association rather than test significance to avoid overinterpretation. Here, we adopt a Bayesian hierarchical mixture model to estimate directly the proportion of influential markers, and then proceed to a selection procedure based on the Bayes factor (BF). This mixture model is able to accommodate different sources of dependence in the data through only a few parameters. Specifically, we focus on a standardized risk measure of unit variance so that fewer parameters are involved in inference. The expected value of this measure follows a mixture distribution with a mixing probability of association, and it is robust to minor allele frequencies. Furthermore, to select promising markers, we use the magnitude of the BF to represent the strength of evidence in support of the association between markers and disease. We demonstrate this procedure both with simulations and with SNP data from studies on rheumatoid arthritis, coronary artery disease, and Crohn's disease obtained from the Wellcome Trust Case-Control Consortium. This Bayesian procedure outperforms other existing methods in terms of accuracy, power, and computational efficiency. The R code that implements this method is available at http://homepage.ntu.edu.tw/similar to ckhsiao/Bmix/Bmix.htm. European Journal of Human Genetics (2010) 18, 942-947; doi:10.1038/ejhg.2010.51; published online 21 April 2010
引用
收藏
页码:942 / 947
页数:6
相关论文
共 23 条
  • [1] The Bayesian revolution in genetics
    Beaumont, MA
    Rannala, B
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (04) : 251 - 261
  • [2] A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis
    Begovich, AB
    Carlton, VEH
    Honigberg, LA
    Schrodi, SJ
    Chokkalingam, AP
    Alexander, HC
    Ardlie, KG
    Huang, QQ
    Smith, AM
    Spoerke, JM
    Conn, MT
    Chang, M
    Chang, SYP
    Saiki, RK
    Catanese, JJ
    Leong, DU
    Garcia, VE
    McAllister, LB
    Jeffery, DA
    Lee, AT
    Batliwalla, F
    Remmers, E
    Criswell, LA
    Seldin, MF
    Kastner, DL
    Amos, CI
    Sninsky, JJ
    Gregersen, PK
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 75 (02) : 330 - 337
  • [3] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [4] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [5] False discovery control with p-value weighting
    Genovese, Christopher R.
    Roeder, Kathryn
    Wasserman, Larry
    [J]. BIOMETRIKA, 2006, 93 (03) : 509 - 524
  • [6] Investigation of genetic variation across the protein tyrosine phosphatase gene in patients with rheumatoid arthritis in the UK
    Hinks, Anne
    Eyre, Steve
    Barton, Anne
    Thomson, Wendy
    Worthington, Jane
    [J]. ANNALS OF THE RHEUMATIC DISEASES, 2007, 66 (05) : 683 - 686
  • [7] Variation analysis and gene annotation of eight MHC haplotypes: The MHC haplotype project
    Horton, Roger
    Gibson, Richard
    Coggill, Penny
    Miretti, Marcos
    Allcock, Richard J.
    Almeida, Jeff
    Forbes, Simon
    Gilbert, James G. R.
    Halls, Karen
    Harrow, Jennifer L.
    Hart, Elizabeth
    Howe, Kevin
    Jackson, David K.
    Palmer, Sophie
    Roberts, Anne N.
    Sims, Sarah
    Stewart, C. Andrew
    Traherne, James A.
    Trevanion, Steve
    Wilming, Laurens
    Rogers, Jane
    de Jong, Pieter J.
    Elliott, John F.
    Sawcer, Stephen
    Todd, John A.
    Trowsdale, John
    Beck, Stephan
    [J]. IMMUNOGENETICS, 2008, 60 (01) : 1 - 18
  • [8] HUNG RJ, 2007, CANCER EPIDEM BIOMAR, V81, P397
  • [9] BAYES FACTORS
    KASS, RE
    RAFTERY, AE
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) : 773 - 795
  • [10] A genome scan for loci influencing anti-atherogenic serum bilirubin levels
    Kronenberg, F
    Coon, H
    Gutin, A
    Abkevich, V
    Samuels, ME
    Ballinger, DG
    Hopkins, PN
    Hunt, SC
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2002, 10 (09) : 539 - 546