Functional Linear Models for Association Analysis of Quantitative Traits

被引:57
作者
Fan, Ruzong [1 ]
Wang, Yifan [1 ]
Mills, James L. [2 ]
Wilson, Alexander F. [3 ]
Bailey-Wilson, Joan E. [3 ]
Xiong, Momiao [4 ]
机构
[1] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Biostat & Bioinformat Branch, Div Intramural Populat Hlth Res, NIH, Rockville, MD 20852 USA
[2] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Epidemiol Branch, Div Intramural Populat Hlth Res, NIH, Rockville, MD 20852 USA
[3] NHGRI, Inherited Dis Res Branch, NIH, Bethesda, MD 20892 USA
[4] Univ Texas Houston, Human Genet Ctr, Houston, TX USA
基金
美国国家卫生研究院;
关键词
rare variants; common variants; association mapping; quantitative trait loci; complex traits; functional data analysis; RARE VARIANTS; SEQUENCING ASSOCIATION; COMMON DISEASES; REGRESSION; TESTS; SIMILARITY; STRATEGIES;
D O I
10.1002/gepi.21757
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Functional linear models are developed in this paper for testing associations between quantitative traits and genetic variants, which can be rare variants or common variants or the combination of the two. By treating multiple genetic variants of an individual in a human population as a realization of a stochastic process, the genome of an individual in a chromosome region is a continuum of sequence data rather than discrete observations. The genome of an individual is viewed as a stochastic function that contains both linkage and linkage disequilibrium (LD) information of the genetic markers. By using techniques of functional data analysis, both fixed and mixed effect functional linear models are built to test the association between quantitative traits and genetic variants adjusting for covariates. After extensive simulation analysis, it is shown that the F-distributed tests of the proposed fixed effect functional linear models have higher power than that of sequence kernel association test (SKAT) and its optimal unified test (SKAT-O) for three scenarios in most cases: (1) the causal variants are all rare, (2) the causal variants are both rare and common, and (3) the causal variants are common. The superior performance of the fixed effect functional linear models is most likely due to its optimal utilization of both genetic linkage and LD information of multiple genetic variants in a genome and similarity among different individuals, while SKAT and SKAT-O only model the similarities and pairwise LD but do not model linkage and higher order LD information sufficiently. In addition, the proposed fixed effect models generate accurate type I error rates in simulation studies. We also show that the functional kernel score tests of the proposed mixed effect functional linear models are preferable in candidate gene analysis and small sample problems. The methods are applied to analyze three biochemical traits in data from the Trinity Students Study.
引用
收藏
页码:726 / 742
页数:17
相关论文
共 49 条
[1]   Primer: Sequencing - the next generation [J].
不详 .
NATURE METHODS, 2008, 5 (01) :15-15
[2]  
[Anonymous], 2001, PRACTICAL GUIDE SPLI
[3]  
[Anonymous], 1996, Stochastic Processes
[4]   Statistical analysis strategies for association studies involving rare variants [J].
Bansal, Vikas ;
Libiger, Ondrej ;
Torkamani, Ali ;
Schork, Nicholas J. .
NATURE REVIEWS GENETICS, 2010, 11 (11) :773-785
[5]   Accurate detection and genotyping of SNPs utilizing population sequencing data [J].
Bansal, Vikas ;
Harismendy, Olivier ;
Tewhey, Ryan ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. ;
Frazer, Kelly A. .
GENOME RESEARCH, 2010, 20 (04) :537-545
[6]   Detecting Rare Variant Effects Using Extreme Phenotype Sampling in Sequencing Association Studies [J].
Barnett, Ian J. ;
Lee, Seunggeun ;
Lin, Xihong .
GENETIC EPIDEMIOLOGY, 2013, 37 (02) :142-151
[7]  
Clarke J, 2009, NAT NANOTECHNOL, V4, P265, DOI [10.1038/NNANO.2009.12, 10.1038/nnano.2009.12]
[8]  
Davies R. B., 1980, J ROYAL STAT SOC C, V29, P323, DOI DOI 10.2307/2346911
[9]   Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods [J].
Duchesne, Pierre ;
De Micheaux, Pierre Lafaye .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (04) :858-862
[10]  
Ferraty F.Y. Romain., 2010, OXFORD HDB FUNCTIONA