A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data

被引:9
作者
Lin, Nan [1 ]
Zhu, Yun [2 ]
Fan, Ruzong [3 ]
Xiong, Momiao [1 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Dept Biostat & Data Sci, Sch Publ Hlth, Houston, TX 77030 USA
[2] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Epidemiol, New Orleans, LA USA
[3] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Biostat & Bioinformat Branch, Div Intramural Populat Hlth Res, NIH, Bethesda, MD USA
基金
美国国家卫生研究院;
关键词
GENOME-WIDE ASSOCIATION; PRINCIPAL-COMPONENT ANALYSIS; LINEAR MIXED-MODEL; CONGENITAL HEART-DISEASE; GENOTYPE-PHENOTYPE MAP; C-REACTIVE PROTEIN; MULTIPLE PHENOTYPES; MULTIVARIATE PHENOTYPES; COMPLEX ORGANISMS; TRAIT ASSOCIATION;
D O I
10.1371/journal.pcbi.1005788
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics.
引用
收藏
页数:33
相关论文
共 83 条
[1]   Therapeutic and protective effects of Caesalpinia gilliesii and Cajanus cajan proteins against acetaminophen overdose-induced renal damage [J].
Aly, Hanan F. ;
Rizk, Maha Z. ;
Abo-Elmatty, Dina M. ;
Desoky, M. M. ;
Ibrahim, N. A. ;
Younis, Eman A. .
TOXICOLOGY AND INDUSTRIAL HEALTH, 2016, 32 (04) :753-768
[2]  
Anderson T. W., 1962, INTRO MULTIVARIATE S
[3]  
[Anonymous], 2017, CATALOG PUBLISHED GE
[4]   Maximizing the Power of Principal-Component Analysis of Correlated Phenotypes in Genome-wide Association Studies [J].
Aschard, Hugues ;
Vilhjalmsson, Bjarni J. ;
Greliche, Nicolas ;
Morange, Pierre-Emmanuel ;
Tregouet, David-Alexandre ;
Kraft, Peter .
AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 94 (05) :662-676
[5]   Gender differences in Nociceptin/Orphanin FQ-induced food intake in strains derived from rats prone (WOKW) and resistant (Dark Agouti) to metabolic syndrome: a possible involvement of the cocaine- and amphetamine-regulated transcript system [J].
Battistoni, Serena ;
Kloting, Ingrid ;
Cifani, Carlo ;
Massi, Maurizio ;
Polidori, Carlo .
GENES AND NUTRITION, 2011, 6 (02) :197-202
[6]   Rank-Based Inverse Normal Transformations are Increasingly Used, But are They Merited? [J].
Beasley, T. Mark ;
Erickson, Stephen ;
Allison, David B. .
BEHAVIOR GENETICS, 2009, 39 (05) :580-595
[7]   Pleiotropic effects of statins: new therapeutic targets in drug design [J].
Bedi, Onkar ;
Dhawan, Veena ;
Sharma, P. L. ;
Kumar, Puneet .
NAUNYN-SCHMIEDEBERGS ARCHIVES OF PHARMACOLOGY, 2016, 389 (07) :695-712
[8]   A Statistical Approach for Testing Cross-Phenotype Effects of Rare Variants [J].
Broadaway, K. Alaine ;
Cutler, David J. ;
Duncan, Richard ;
Moore, Jacob L. ;
Ware, Erin B. ;
Jhun, Min A. ;
Bielak, Lawrence F. ;
Zhao, Wei ;
Smith, Jennifer A. ;
Peyser, Patricia A. ;
Kardia, Sharon L. R. ;
Ghosh, Debashis ;
Epstein, Michael P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 98 (03) :525-540
[9]   An atlas of genetic correlations across human diseases and traits [J].
Bulik-Sullivan, Brendan ;
Finucane, Hilary K. ;
Anttila, Verneri ;
Gusev, Alexander ;
Day, Felix R. ;
Loh, Po-Ru ;
Duncan, Laramie ;
Perry, John R. B. ;
Patterson, Nick ;
Robinson, Elise B. ;
Daly, Mark J. ;
Price, Alkes L. ;
Neale, Benjamin M. .
NATURE GENETICS, 2015, 47 (11) :1236-+
[10]  
Chen W, 2015, BIOMED RES INT, V2015