A computationally efficient clustering linear combination approach to jointly analyze multiple phenotypes for GWAS

被引:3
作者
Wang, Meida [1 ]
Zhang, Shuanglin [1 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Math Sci, Houghton, MI 49931 USA
基金
美国国家卫生研究院;
关键词
GENOME-WIDE ASSOCIATION; PRINCIPAL-COMPONENTS; GENETIC ASSOCIATION; TRAIT ANALYSIS; PLEIOTROPY; RARE; STATISTICS; POWER; HERITABILITY; VARIANTS;
D O I
10.1371/journal.pone.0260911
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
There has been an increasing interest in joint analysis of multiple phenotypes in genome-wide association studies (GWAS) because jointly analyzing multiple phenotypes may increase statistical power to detect genetic variants associated with complex diseases or traits. Recently, many statistical methods have been developed for joint analysis of multiple phenotypes in genetic association studies, including the Clustering Linear Combination (CLC) method. The CLC method works particularly well with phenotypes that have natural groupings, but due to the unknown number of clusters for a given data, the final test statistic of CLC method is the minimum p-value among all p-values of the CLC test statistics obtained from each possible number of clusters. Therefore, a simulation procedure needs to be used to evaluate the p-value of the final test statistic. This makes the CLC method computationally demanding. We develop a new method called computationally efficient CLC (ceCLC) to test the association between multiple phenotypes and a genetic variant. Instead of using the minimum p-value as the test statistic in the CLC method, ceCLC uses the Cauchy combination test to combine all p-values of the CLC test statistics obtained from each possible number of clusters. The test statistic of ceCLC approximately follows a standard Cauchy distribution, so the p-value can be obtained from the cumulative density function without the need for the simulation procedure. Through extensive simulation studies and application on the COPDGene data, the results demonstrate that the type I error rates of ceCLC are effectively controlled in different simulation settings and ceCLC either outperforms all other methods or has statistical power that is very close to the most powerful method with which it has been compared.
引用
收藏
页数:13
相关论文
共 54 条
[1]   Systemic effects of chronic obstructive pulmonary disease [J].
Agustí, AGN ;
Noguera, A ;
Sauleda, J ;
Sala, E ;
Pons, J ;
Busquets, X .
EUROPEAN RESPIRATORY JOURNAL, 2003, 21 (02) :347-360
[2]   Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies [J].
Aschard, Hugues ;
Vilhjalmsson, Bjarni J. ;
Greliche, Nicolas ;
Morange, Pierre-Emmanuel ;
Tregouet, David-Alexandre ;
Kraft, Peter .
AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 94 (05) :662-676
[3]   Chronic Obstructive Pulmonary Disease: Effects beyond the Lungs [J].
Barnes, Peter J. .
PLOS MEDICINE, 2010, 7 (03) :1-4
[4]   Linear mixed models and penalized least squares [J].
Bates, DM ;
DebRoy, S .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 91 (01) :1-17
[5]   Variants in FAM13A are associated with chronic obstructive pulmonary disease [J].
Cho, Michael H. ;
Boutaoui, Nadia ;
Klanderman, Barbara J. ;
Sylvia, Jody S. ;
Ziniti, John P. ;
Hersh, Craig P. ;
DeMeo, Dawn L. ;
Hunninghake, Gary M. ;
Litonjua, Augusto A. ;
Sparrow, David ;
Lange, Christoph ;
Won, Sungho ;
Murphy, James R. ;
Beaty, Terri H. ;
Regan, Elizabeth A. ;
Make, Barry J. ;
Hokanson, John E. ;
Crapo, James D. ;
Kong, Xiangyang ;
Anderson, Wayne H. ;
Tal-Singer, Ruth ;
Lomas, David A. ;
Bakke, Per ;
Gulsvik, Amund ;
Pillai, Sreekumar G. ;
Silverman, Edwin K. .
NATURE GENETICS, 2010, 42 (03) :200-202
[6]   Comparison of methods for multivariate gene-based association tests for complex diseases using common variants [J].
Chung, Jaeyoon ;
Jun, Gyungah R. ;
Dupuis, Josee ;
Farrer, Lindsay A. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 (05) :811-823
[7]   HOW THE POWER OF MANOVA CAN BOTH INCREASE AND DECREASE AS A FUNCTION OF THE INTERCORRELATIONS AMONG THE DEPENDENT-VARIABLES [J].
COLE, DA ;
MAXWELL, SE ;
ARVEY, R ;
SALAS, E .
PSYCHOLOGICAL BULLETIN, 1994, 115 (03) :465-474
[8]   Conditional analysis of multiple quantitative traits based on marginal GWAS summary statistics [J].
Deng, Yangqing ;
Pan, Wei .
GENETIC EPIDEMIOLOGY, 2017, 41 (05) :427-436
[9]   Genetic pleiotropy in complex traits and diseases: implications for genomic medicine [J].
Gratten, Jacob ;
Visscher, Peter M. .
GENOME MEDICINE, 2016, 8
[10]   The nature of small-airway obstruction in chronic obstructive pulmonary disease [J].
Hogg, JC ;
Chu, F ;
Utokaparch, S ;
Woods, R ;
Elliott, WM ;
Buzatu, L ;
Cherniack, RM ;
Rogers, RM ;
Sciurba, FC ;
Coxson, HO ;
Paré, PD .
NEW ENGLAND JOURNAL OF MEDICINE, 2004, 350 (26) :2645-2653