Revisiting the genome-wide significance threshold for common variant GWAS

被引:93
作者
Chen, Zhongsheng
Boehnke, Michael [1 ,2 ]
Wen, Xiaoquan
Mukherjee, Bhramar
机构
[1] Univ Michigan, Sch Publ Hlth, Dept Biostat, 1415 Washington Hts, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Sch Publ Hlth, Ctr Stat Genet, 1415 Washington Hts, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
multiple testing; FDR; family-wise error rate; Bonferroni correction; Benjamini-Hochberg; Bayesian false discovery probability; FALSE DISCOVERY RATE; ASSOCIATION; LOCI; METAANALYSIS; HAPLOTYPES; EQTL;
D O I
10.1093/g3journal/jkaa056
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Over the last decade, GWAS meta-analyses have used a strict P-value threshold of 5 x 10(-8) to classify associations as significant. Here, we use our current understanding of frequently studied traits including lipid levels, height, and BMI to revisit this genome-wide significance threshold. We compare the performance of studies using the P = 5 x 10(-8) threshold in terms of true and false positive rate to other multiple testing strategies: (1) less stringent P-value thresholds, (2) controlling the FDR with the Benjamini-Hochberg and Benjamini-Yekutieli procedure, and (3) controlling the Bayesian FDR with posterior probabilities. We applied these procedures to re-analyze results from the Global Lipids and GIANT GWAS meta-analysis consortia and supported them with extensive simulation that mimics the empirical data. We observe in simulated studies with sample sizes similar to 20,000 and >120,000 that relaxing the P-value threshold to 5 x 10(-7) increased discovery at the cost of 18% and 8% of additional loci being false positive results, respectively. FDR and Bayesian FDR are well controlled for both sample sizes with a few exceptions that disappear under a less stringent definition of true positives and the two approaches yield similar results. Our work quantifies the value of using a relaxed P-value threshold in large studies to increase their true positive discovery but also show the excess false positive rates due to such actions in modest-sized studies. These results may guide investigators considering different thresholds in replication studies and downstream work such as gene-set enrichment or pathway analysis. Finally, we demonstrate the viability of FDR-controlling procedures in GWAS.
引用
收藏
页数:12
相关论文
共 45 条
[31]   The effect of correlation in false discovery rate estimation [J].
Schwartzman, Armin ;
Lin, Xihong .
BIOMETRIKA, 2011, 98 (01) :199-214
[32]   Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index [J].
Speliotes, Elizabeth K. ;
Willer, Cristen J. ;
Berndt, Sonja I. ;
Monda, Keri L. ;
Thorleifsson, Gudmar ;
Jackson, Anne U. ;
Allen, Hana Lango ;
Lindgren, Cecilia M. ;
Luan, Jian'an ;
Maegi, Reedik ;
Randall, Joshua C. ;
Vedantam, Sailaja ;
Winkler, Thomas W. ;
Qi, Lu ;
Workalemahu, Tsegaselassie ;
Heid, Iris M. ;
Steinthorsdottir, Valgerdur ;
Stringham, Heather M. ;
Weedon, Michael N. ;
Wheeler, Eleanor ;
Wood, Andrew R. ;
Ferreira, Teresa ;
Weyant, Robert J. ;
Segre, Ayellet V. ;
Estrada, Karol ;
Liang, Liming ;
Nemesh, James ;
Park, Ju-Hyun ;
Gustafsson, Stefan ;
Kilpelaenen, Tuomas O. ;
Yang, Jian ;
Bouatia-Naji, Nabila ;
Esko, Tonu ;
Feitosa, Mary F. ;
Kutalik, Zoltan ;
Mangino, Massimo ;
Raychaudhuri, Soumya ;
Scherag, Andre ;
Smith, Albert Vernon ;
Welch, Ryan ;
Zhao, Jing Hua ;
Aben, Katja K. ;
Absher, Devin M. ;
Amin, Najaf ;
Dixon, Anna L. ;
Fisher, Eva ;
Glazer, Nicole L. ;
Goddard, Michael E. ;
Heard-Costa, Nancy L. ;
Hoesel, Volker .
NATURE GENETICS, 2010, 42 (11) :937-U53
[33]   Nonparametric Bayesian estimation of positive false discovery rates [J].
Tang, Yongqiang ;
Ghosal, Subhashis ;
Roy, Anindya .
BIOMETRICS, 2007, 63 (04) :1126-1134
[34]   Biological, clinical and population relevance of 95 loci for blood lipids [J].
Teslovich, Tanya M. ;
Musunuru, Kiran ;
Smith, Albert V. ;
Edmondson, Andrew C. ;
Stylianou, Ioannis M. ;
Koseki, Masahiro ;
Pirruccello, James P. ;
Ripatti, Samuli ;
Chasman, Daniel I. ;
Willer, Cristen J. ;
Johansen, Christopher T. ;
Fouchier, Sigrid W. ;
Isaacs, Aaron ;
Peloso, Gina M. ;
Barbalic, Maja ;
Ricketts, Sally L. ;
Bis, Joshua C. ;
Aulchenko, Yurii S. ;
Thorleifsson, Gudmar ;
Feitosa, Mary F. ;
Chambers, John ;
Orho-Melander, Marju ;
Melander, Olle ;
Johnson, Toby ;
Li, Xiaohui ;
Guo, Xiuqing ;
Li, Mingyao ;
Cho, Yoon Shin ;
Go, Min Jin ;
Kim, Young Jin ;
Lee, Jong-Young ;
Park, Taesung ;
Kim, Kyunga ;
Sim, Xueling ;
Ong, Rick Twee-Hee ;
Croteau-Chonka, Damien C. ;
Lange, Leslie A. ;
Smith, Joshua D. ;
Song, Kijoung ;
Zhao, Jing Hua ;
Yuan, Xin ;
Luan, Jian'an ;
Lamina, Claudia ;
Ziegler, Andreas ;
Zhang, Weihua ;
Zee, Robert Y. L. ;
Wright, Alan F. ;
Witteman, Jacqueline C. M. ;
Wilson, James F. ;
Willemsen, Gonneke .
NATURE, 2010, 466 (7307) :707-713
[35]   A Bayesian measure of the probability of false discovery in genetic epidemiology studies [J].
Wakefield, Jon .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (02) :208-227
[36]   Moving to a World Beyond "p<0.05" [J].
Wasserstein, Ronald L. ;
Schirm, Allen L. ;
Lazar, Nicole A. .
AMERICAN STATISTICIAN, 2019, 73 :1-19
[37]   Robust Bayesian FDR Control Using Bayes Factors, with Applications to Multi-tissue eQTL Discovery [J].
Wen X. .
Statistics in Biosciences, 2017, 9 (1) :28-49
[38]  
WILLER CJ, 2013, NAT GENET, V45, P1274, DOI DOI 10.1038/NG.2797
[39]   Newly identified loci that influence lipid concentrations and risk of coronary artery disease [J].
Willer, Cristen J. ;
Sanna, Serena ;
Jackson, Anne U. ;
Scuteri, Angelo ;
Bonnycastle, Lori L. ;
Clarke, Robert ;
Heath, Simon C. ;
Timpson, Nicholas J. ;
Najjar, Samer S. ;
Stringham, Heather M. ;
Strait, James ;
Duren, William L. ;
Maschio, Andrea ;
Busonero, Fabio ;
Mulas, Antonella ;
Albai, Giuseppe ;
Swift, Amy J. ;
Morken, Mario A. ;
Narisu, Narisu ;
Bennett, Derrick ;
Parish, Sarah ;
Shen, Haiqing ;
Galan, Pilar ;
Meneton, Pierre ;
Hercberg, Serge ;
Zelenika, Diana ;
Chen, Wei-Min ;
Li, Yun ;
Scott, Laura J. ;
Scheet, Paul A. ;
Sundvall, Jouko ;
Watanabe, Richard M. ;
Nagaraja, Ramaiah ;
Ebrahim, Shah ;
Lawlor, Debbie A. ;
Ben-Shlomo, Yoav ;
Davey-Smith, George ;
Shuldiner, Alan R. ;
Collins, Rory ;
Bergman, Richard N. ;
Uda, Manuela ;
Tuomilehto, Jaakko ;
Cao, Antonio ;
Collins, Francis S. ;
Lakatta, Edward ;
Lathrop, G. Mark ;
Boehnke, Michael ;
Schlessinger, David ;
Mohlke, Karen L. ;
Abecasis, Goncalo R. .
NATURE GENETICS, 2008, 40 (02) :161-169
[40]   METAL: fast and efficient meta-analysis of genomewide association scans [J].
Willer, Cristen J. ;
Li, Yun ;
Abecasis, Goncalo R. .
BIOINFORMATICS, 2010, 26 (17) :2190-2191