A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics

被引:2
作者
Wang, Meida [1 ]
Cao, Xuewei [1 ]
Zhang, Shuanglin [1 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Math Sci, Houghton, MI 49931 USA
关键词
TRAIT ANALYSIS; GENE; CLASSIFICATION; DISEASES; TESTS; MODEL; RARE;
D O I
10.1038/s41598-023-30415-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
There is strong evidence showing that joint analysis of multiple phenotypes in genome-wide association studies (GWAS) can increase statistical power when detecting the association between genetic variants and human complex diseases. We previously developed the Clustering Linear Combination (CLC) method and a computationally efficient CLC (ceCLC) method to test the association between multiple phenotypes and a genetic variant, which perform very well. However, both of these methods require individual-level genotypes and phenotypes that are often not easily accessible. In this research, we develop a novel method called sCLC for association studies of multiple phenotypes and a genetic variant based on GWAS summary statistics. We use the LD score regression to estimate the correlation matrix among phenotypes. The test statistic of sCLC is constructed by GWAS summary statistics and has an approximate Cauchy distribution. We perform a variety of simulation studies and compare sCLC with other commonly used methods for multiple phenotype association studies using GWAS summary statistics. Simulation results show that sCLC can control Type I error rates well and has the highest power in most scenarios. Moreover, we apply the newly developed method to the UK Biobank GWAS summary statistics from the XIII category with 70 related musculoskeletal system and connective tissue phenotypes. The results demonstrate that sCLC detects the most number of significant SNPs, and most of these identified SNPs can be matched to genes that have been reported in the GWAS catalog to be associated with those phenotypes. Furthermore, sCLC also identifies some novel signals that were missed by standard GWAS, which provide new insight into the potential genetic factors of the musculoskeletal system and connective tissue phenotypes.
引用
收藏
页数:10
相关论文
共 47 条
  • [31] Dissecting the genetics of complex traits using summary association statistics
    Pasaniuc, Bogdan
    Price, Alkes L.
    [J]. NATURE REVIEWS GENETICS, 2017, 18 (02) : 117 - 127
  • [32] Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics
    Pei, Guangsheng
    Sun, Hua
    Dai, Yulin
    Liu, Xiaoming
    Zhao, Zhongming
    Jia, Peilin
    [J]. BMC GENOMICS, 2019, 20 (Suppl 1)
  • [33] Methods for Analyzing Multivariate Phenotypes in Genetic Association Studies
    Qiong Yang
    Yuanjia Wang
    [J]. JOURNAL OF PROBABILITY AND STATISTICS, 2012, 2012
  • [34] Multivariate generalized linear model for genetic pleiotropy
    Schaid, Daniel J.
    Tong, Xingwei
    Batzler, Anthony
    Sinnwell, Jason P.
    Qing, Jiang
    Biernacka, Joanna M.
    [J]. BIOSTATISTICS, 2019, 20 (01) : 111 - 128
  • [35] A clustering linear combination approach to jointly analyze multiple phenotypes for GWAS
    Sha, Qiuying
    Wang, Zhenchuan
    Zhang, Xiao
    Zhang, Shuanglin
    [J]. BIOINFORMATICS, 2019, 35 (08) : 1373 - 1379
  • [36] Joint Analysis for Genome-Wide Association Studies in Family-Based Designs
    Sha, Qiuying
    Zhang, Zhaogong
    Zhang, Shuanglin
    [J]. PLOS ONE, 2011, 6 (07):
  • [37] A Unified Framework for Association Analysis with Multiple Related Phenotypes
    Stephens, Matthew
    [J]. PLOS ONE, 2013, 8 (07):
  • [38] UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age
    Sudlow, Cathie
    Gallacher, John
    Allen, Naomi
    Beral, Valerie
    Burton, Paul
    Danesh, John
    Downey, Paul
    Elliott, Paul
    Green, Jane
    Landray, Martin
    Liu, Bette
    Matthews, Paul
    Ong, Giok
    Pell, Jill
    Silman, Alan
    Young, Alan
    Sprosen, Tim
    Peakman, Tim
    Collins, Rory
    [J]. PLOS MEDICINE, 2015, 12 (03)
  • [39] A Novel Framework for Analysis of the Shared Genetic Background of Correlated Traits
    Svishcheva, Gulnara R.
    Tiys, Evgeny S.
    Elgaeva, Elizaveta E.
    Feoktistova, Sofia G.
    Timmers, Paul R. H. J.
    Sharapov, Sodbo Zh
    Axenovich, Tatiana, I
    Tsepilov, Yakov A.
    [J]. GENES, 2022, 13 (10)
  • [40] Multi-trait analysis of genome-wide association summary statistics using MTAG
    Turley, Patrick
    Walters, Raymond K.
    Maghzian, Omeed
    Okbay, Aysu
    Lee, James J.
    Fontana, Mark Alan
    Tuan Anh Nguyen-Viet
    Wedow, Robbee
    Zacher, Meghan
    Furlotte, Nicholas A.
    Magnusson, Patrik
    Oskarsson, Sven
    Johannesson, Magnus
    Visscher, Peter M.
    Laibson, David
    Cesarini, David
    Neale, Benjamin M.
    Benjamin, Daniel J.
    [J]. NATURE GENETICS, 2018, 50 (02) : 229 - +