Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors

被引:30
作者
Chen, Wenhan [1 ,2 ]
Wu, Yang [1 ]
Zheng, Zhili [1 ]
Qi, Ting [1 ,3 ,4 ]
Visscher, Peter M. [1 ]
Zhu, Zhihong [1 ]
Yang, Jian [1 ,3 ,4 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
[2] Garvan Inst Med Res, Genom & Epigenet Theme, Epigenet Res Lab, Sydney, NSW 2010, Australia
[3] Westlake Univ, Sch Life Sci, Hangzhou 310024, Zhejiang, Peoples R China
[4] Westlake Lab Life Sci & Biomed, Hangzhou 310024, Zhejiang, Peoples R China
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
GENOME-WIDE ASSOCIATION; GENETIC ARCHITECTURE; SUSCEPTIBILITY LOCI; CAUSAL VARIANTS; COMPLEX TRAITS; RISK; IMPUTATION; IDENTIFICATION; HERITABILITY; REGRESSION;
D O I
10.1038/s41467-021-27438-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analyses of summary statistics from GWAS are subject to biases due to errors in the discovery GWAS or linkage disequilibrium reference data set or heterogeneity between data sets. Here, the authors propose a quality control method to be added to analysis of GWAS summary data that can reduce such biases. Summary statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or LD reference or heterogeneity between GWAS and LD reference. Here we propose a quality control method, DENTIST, that leverages LD among genetic variants to detect and eliminate errors in GWAS or LD reference and heterogeneity between the two. Through simulations, we demonstrate that DENTIST substantially reduces false-positive rate in detecting secondary signals in the summary-data-based conditional and joint association analysis, especially for imputed rare variants (false-positive rate reduced from >28% to <2% in the presence of heterogeneity between GWAS and LD reference). We further show that DENTIST can improve other summary-data-based analyses such as fine-mapping analysis.
引用
收藏
页数:10
相关论文
共 54 条
[1]   Prospects of Fine-Mapping Trait-Associated Genomic Regions by Using Summary Statistics from Genome-wide Association Studies [J].
Benner, Christian ;
Havulinna, Aki S. ;
Jarvelin, Marjo-Riitta ;
Salomaa, Veikko ;
Ripatti, Samuli ;
Pirinen, Matti .
AMERICAN JOURNAL OF HUMAN GENETICS, 2017, 101 (04) :539-551
[2]   FINEMAP: efficient variable selection using summary data from genome-wide association studies [J].
Benner, Christian ;
Spencer, Chris C. A. ;
Havulinna, Aki S. ;
Salomaa, Veikko ;
Ripatti, Samuli ;
Pirinen, Matti .
BIOINFORMATICS, 2016, 32 (10) :1493-1501
[3]   An atlas of genetic correlations across human diseases and traits [J].
Bulik-Sullivan, Brendan ;
Finucane, Hilary K. ;
Anttila, Verneri ;
Gusev, Alexander ;
Day, Felix R. ;
Loh, Po-Ru ;
Duncan, Laramie ;
Perry, John R. B. ;
Patterson, Nick ;
Robinson, Elise B. ;
Daly, Mark J. ;
Price, Alkes L. ;
Neale, Benjamin M. .
NATURE GENETICS, 2015, 47 (11) :1236-+
[4]   LD Score regression distinguishes confounding from polygenicity in genome-wide association studies [J].
Bulik-Sullivan, Brendan K. ;
Loh, Po-Ru ;
Finucane, Hilary K. ;
Ripke, Stephan ;
Yang, Jian ;
Patterson, Nick ;
Daly, Mark J. ;
Price, Alkes L. ;
Neale, Benjamin M. .
NATURE GENETICS, 2015, 47 (03) :291-+
[5]   The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019 [J].
Buniello, Annalisa ;
MacArthur, Jacqueline A. L. ;
Cerezo, Maria ;
Harris, Laura W. ;
Hayhurst, James ;
Malangone, Cinzia ;
McMahon, Aoife ;
Morales, Joannella ;
Mountjoy, Edward ;
Sollis, Elliot ;
Suveges, Daniel ;
Vrousgou, Olga ;
Whetzel, Patricia L. ;
Amode, Ridwan ;
Guillen, Jose A. ;
Riat, Harpreet S. ;
Trevanion, Stephen J. ;
Hall, Peggy ;
Junkins, Heather ;
Flicek, Paul ;
Burdett, Tony ;
Hindorff, Lucia A. ;
Cunningham, Fiona ;
Parkinson, Helen .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1005-D1012
[6]   The UK Biobank resource with deep phenotyping and genomic data [J].
Bycroft, Clare ;
Freeman, Colin ;
Petkova, Desislava ;
Band, Gavin ;
Elliott, Lloyd T. ;
Sharp, Kevin ;
Motyer, Allan ;
Vukcevic, Damjan ;
Delaneau, Olivier ;
O'Connell, Jared ;
Cortes, Adrian ;
Welsh, Samantha ;
Young, Alan ;
Effingham, Mark ;
McVean, Gil ;
Leslie, Stephen ;
Allen, Naomi ;
Donnelly, Peter ;
Marchini, Jonathan .
NATURE, 2018, 562 (7726) :203-+
[7]  
Chen W., ZENODO, DOI [10.5281/zenodo.5516202, DOI 10.5281/ZENODO.5516202]
[8]   Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics [J].
Chen, Wenan ;
Larrabee, Beth R. ;
Ovsyannikova, Inna G. ;
Kennedy, Richard B. ;
Haralambieva, Iana H. ;
Poland, Gregory A. ;
Schaid, Daniel J. .
GENETICS, 2015, 200 (03) :719-+
[9]   Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants [J].
Dadaev, Tokhir ;
Saunders, Edward J. ;
Newcombe, Paul J. ;
Anokian, Ezequiel ;
Leongamornlert, Daniel A. ;
Brook, Mark N. ;
Cieza-Borrella, Clara ;
Mijuskovic, Martina ;
Wakerell, Sarah ;
Al Olama, Ali Amin ;
Schumacher, Fredrick R. ;
Berndt, Sonja I. ;
Benlloch, Sara ;
Ahmed, Mahbubl ;
Goh, Chee ;
Sheng, Xin ;
Zhang, Zhuo ;
Muir, Kenneth ;
Govindasami, Koveela ;
Lophatananon, Artitaya ;
Stevens, Victoria L. ;
Gapstur, Susan M. ;
Carter, Brian D. ;
Tangen, Catherine M. ;
Goodman, Phyllis ;
Thompson, Ian M., Jr. ;
Batra, Jyotsna ;
Chambers, Suzanne ;
Moya, Leire ;
Clements, Judith ;
Horvath, Lisa ;
Tilley, Wayne ;
Risbridger, Gail ;
Gronberg, Henrik ;
Aly, Markus ;
Nordstrom, Tobias ;
Pharoah, Paul ;
Pashayan, Nora ;
Schleutker, Johanna ;
Tammela, Teuvo L. J. ;
Sipeky, Csilla ;
Auvinen, Anssi ;
Albanes, Demetrius ;
Weinstein, Stephanie ;
Wolk, Alicja ;
Hakansson, Niclas ;
West, Catharine ;
Dunning, Alison M. ;
Burnet, Neil ;
Mucci, Lorelei .
NATURE COMMUNICATIONS, 2018, 9
[10]   Partitioning heritability by functional annotation using genome-wide association summary statistics [J].
Finucane, Hilary K. ;
Bulik-Sullivan, Brendan ;
Gusev, Alexander ;
Trynka, Gosia ;
Reshef, Yakir ;
Loh, Po-Ru ;
Anttila, Verneri ;
Xu, Han ;
Zang, Chongzhi ;
Farh, Kyle ;
Ripke, Stephan ;
Day, Felix R. ;
Purcell, Shaun ;
Stahl, Eli ;
Lindstrom, Sara ;
Perry, John R. B. ;
Okada, Yukinori ;
Raychaudhuri, Soumya ;
Daly, Mark J. ;
Patterson, Nick ;
Neale, Benjamin M. ;
Price, Alkes L. .
NATURE GENETICS, 2015, 47 (11) :1228-+