A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits

被引:9
作者
Fan, Ruzong [1 ]
Chiu, Chi-yang [1 ]
Jung, Jeesun [2 ]
Weeks, Daniel E. [3 ,4 ]
Wilson, Alexander F. [5 ]
Bailey-Wilson, Joan E. [5 ]
Amos, Christopher I. [6 ]
Chen, Zhen [1 ]
Mills, James L. [7 ]
Xiong, Momiao [8 ]
机构
[1] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Biostat & Bioinformat Branch, Div Intramural Populat Hlth Res, NIH, Bethesda, MD 20892 USA
[2] NIAAA, Lab Epidemiol & Biometry, NIH, Bethesda, MD USA
[3] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA 15261 USA
[4] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[5] NHGRI, Computat & Stat Genom Branch, NIH, Bethesda, MD 20892 USA
[6] Geisel Sch Med Dartmouth, Dept Biomed Data Sci, Lebanon, NH USA
[7] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Epidemiol Branch, Div Intramural Populat Hlth Res, NIH, Bethesda, MD 20892 USA
[8] Univ Texas Houston, Ctr Human Genet, Houston, TX USA
关键词
rare variants; common variants; association mapping; quantitative/dichotomous trait loci; complex traits; functional data analysis; multivariate linear models; logistic regressions; FUNCTIONAL LINEAR-MODELS; QUANTITATIVE TRAITS; RARE; LOCI; METAANALYSIS; VARIANTS; DISEASES; LINKAGE;
D O I
10.1002/gepi.21984
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In association studies of complex traits, fixed-effect regression models are usually used to test for association between traits and major gene loci. In recent years, variance-component tests based on mixed models were developed for region-based genetic variant association tests. In the mixed models, the association is tested by a null hypothesis of zero variance via a sequence kernel association test (SKAT), its optimal unified test (SKAT-O), and a combined sum test of rare and common variant effect (SKAT-C). Although there are some comparison studies to evaluate the performance of mixed and fixed models, there is no systematic analysis to determine when the mixed models perform better and when the fixed models perform better. Here we evaluated, based on extensive simulations, the performance of the fixed and mixed model statistics, using genetic variants located in 3, 6, 9, 12, and 15 kb simulated regions. We compared the performance of three models: (i) mixed models that lead to SKAT, SKAT-O, and SKAT-C, (ii) traditional fixed-effect additive models, and (iii) fixed-effect functional regression models. To evaluate the type I error rates of the tests of fixed models, we generated genotype data by two methods: (i) using all variants, (ii) using only rare variants. We found that the fixed-effect tests accurately control or have low false positive rates. We performed simulation analyses to compare power for two scenarios: (i) all causal variants are rare, (ii) some causal variants are rare and some are common. Either one or both of the fixed-effect models performed better than or similar to the mixed models except when (1) the region sizes are 12 and 15 kb and (2) effect sizes are small. Therefore, the assumption of mixed models could be satisfied and SKAT/SKAT-O/SKAT-C could perform better if the number of causal variants is large and each causal variant contributes a small amount to the traits (i.e., polygenes). In major gene association studies, we argue that the fixed-effect models perform better or similarly to mixed models in most cases because some variants should affect the traits relatively large. In practice, it makes sense to perform analysis by both the fixed and mixed effect models and to make a comparison, and this can be readily done using our R codes and the SKAT packages. Published 2016 Wiley Periodicals, Inc.
引用
收藏
页码:702 / 721
页数:20
相关论文
共 32 条
[1]  
[Anonymous], 1984, An introduction to multivariate statistical analysis
[2]  
[Anonymous], PHILOS T ROYAL SOC E
[3]   Sequence Kernel Association Test for Survival Traits [J].
Chen, Han ;
Lumley, Thomas ;
Brody, Jennifer ;
Heard-Costa, Nancy L. ;
Fox, Caroline S. ;
Cupples, L. Adrienne ;
Dupuis, Josee .
GENETIC EPIDEMIOLOGY, 2014, 38 (03) :191-197
[4]   A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data:: Application to HLA in type 1 diabetes [J].
Cordell, HJ ;
Clayton, DG .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) :124-141
[5]   Meta-analysis of Complex Diseases at Gene Level with Generalized Functional Linear Models [J].
Fan, Ruzong ;
Wang, Yifan ;
Chiu, Chi-yang ;
Chen, Wei ;
Ren, Haobo ;
Li, Yun ;
Boehnke, Michael ;
Amos, Christopher I. ;
Moore, Jason H. ;
Xiong, Momiao .
GENETICS, 2016, 202 (02) :457-+
[6]   Gene-Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions [J].
Fan, Ruzong ;
Wang, Yifan ;
Yan, Qi ;
Ding, Ying ;
Weeks, Daniel E. ;
Lu, Zhaohui ;
Ren, Haobo ;
Cook, Richard J. ;
Xiong, Momiao ;
Swaroop, Anand ;
Chew, Emily Y. ;
Chen, Wei .
GENETIC EPIDEMIOLOGY, 2016, 40 (02) :133-143
[7]   Gene Level Meta-Analysis of Quantitative Traits by Functional Linear Models [J].
Fan, Ruzong ;
Wang, Yifan ;
Boehnke, Michael ;
Chen, Wei ;
Li, Yun ;
Ren, Haobo ;
Lobach, Iryna ;
Xiong, Momiao .
GENETICS, 2015, 200 (04) :1089-+
[8]   Generalized Functional Linear Models for Gene-Based Case-Control Association Studies [J].
Fan, Ruzong ;
Wang, Yifan ;
Mills, James L. ;
Carter, Tonia C. ;
Lobach, Iryna ;
Wilson, Alexander F. ;
Bailey-Wilson, Joan E. ;
Weeks, Daniel E. ;
Xiong, Momiao .
GENETIC EPIDEMIOLOGY, 2014, 38 (07) :622-637
[9]   Functional Linear Models for Association Analysis of Quantitative Traits [J].
Fan, Ruzong ;
Wang, Yifan ;
Mills, James L. ;
Wilson, Alexander F. ;
Bailey-Wilson, Joan E. ;
Xiong, Momiao .
GENETIC EPIDEMIOLOGY, 2013, 37 (07) :726-742
[10]   High resolution association mapping of quantitative trait loci: A population-based approach [J].
Fan, RZ ;
Jung, JS ;
Jin, L .
GENETICS, 2006, 172 (01) :663-686