The Value of Statistical or Bioinformatics Annotation for Rare Variant Association With Quantitative Trait

被引:14
作者
Byrnes, Andrea E. [1 ]
Wu, Michael C. [1 ]
Wright, Fred A. [1 ]
Li, Mingyao [2 ]
Li, Yun [1 ,3 ,4 ]
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
[2] Univ Penn, Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
[3] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[4] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
关键词
rare variants; association; weighting; variable selection; variant annotation; PENALIZED REGRESSION; MISSING HERITABILITY; GENOTYPE IMPUTATION; AFRICAN-AMERICANS; COMMON VARIANTS; SEQUENCING DATA; LEAST ANGLE; HAPLOTYPES; STRATEGIES; SELECTION;
D O I
10.1002/gepi.21747
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In the past few years, a plethora of methods for rare variant association with phenotype have been proposed. These methods aggregate information from multiple rare variants across genomic region(s), but there is little consensus as to which method is most effective. The weighting scheme adopted when aggregating information across variants is one of the primary determinants of effectiveness. Here we present a systematic evaluation of multiple weighting schemes through a series of simulations intended to mimic large sequencing studies of a quantitative trait. We evaluate existing phenotype-independent and phenotype-dependent methods, as well as weights estimated by penalized regression approaches including Lasso, Elastic Net, and SCAD. We find that the difference in power between phenotype-dependent schemes is negligible when high-quality functional annotations are available. When functional annotations are unavailable or incomplete, all methods suffer from power loss; however, the variable selection methods outperform the others at the cost of increased computational time. Therefore, in the absence of good annotation, we recommend variable selection methods (which can be viewed as statistical annotation) on top of regions implicated by a phenotype-independent weighting scheme. Further, once a region is implicated, variable selection can help to identify potential causal single nucleotide polymorphisms for biological validation. These findings are supported by an analysis of a high coverage targeted sequencing study of 1,898 individuals.
引用
收藏
页码:666 / 674
页数:9
相关论文
共 52 条
  • [1] An integrated map of genetic variation from 1,092 human genomes
    Altshuler, David M.
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Donnelly, Peter
    Eichler, Evan E.
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Green, Eric D.
    Hurles, Matthew E.
    Knoppers, Bartha M.
    Korbel, Jan O.
    Lander, Eric S.
    Lee, Charles
    Lehrach, Hans
    Mardis, Elaine R.
    Marth, Gabor T.
    McVean, Gil A.
    Nickerson, Deborah A.
    Schmidt, Jeanette P.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Dinh, Huyen
    Kovar, Christie
    Lee, Sandra
    Lewis, Lora
    Muzny, Donna
    Reid, Jeff
    Wang, Min
    Wang, Jun
    Fang, Xiaodong
    Guo, Xiaosen
    Jian, Min
    Jiang, Hui
    Jin, Xin
    Li, Guoqing
    Li, Jingxiang
    Li, Yingrui
    Li, Zhuo
    Liu, Xiao
    Lu, Yao
    Ma, Xuedi
    Su, Zhe
    Tai, Shuaishuai
    Tang, Meifang
    [J]. NATURE, 2012, 491 (7422) : 56 - 65
  • [2] Imputation of Exome Sequence Variants into Population-Based Samples and Blood-Cell-Trait-Associated Loci in African Americans: NHLBI GO Exome Sequencing Project
    Auer, Paul L.
    Johnsen, Jill M.
    Johnson, Andrew D.
    Logsdon, Benjamin A.
    Lange, Leslie A.
    Nalls, Michael A.
    Zhang, Guosheng
    Franceschini, Nora
    Fox, Keolu
    Lange, Ethan M.
    Rich, Stephen S.
    O'Donnell, Christopher J.
    Jackson, Rebecca D.
    Wallace, Robert B.
    Chen, Zhao
    Graubert, Timothy A.
    Wilson, James G.
    Tang, Hua
    Lettre, Guillaume
    Reiner, Alex P.
    Ganesh, Santhi K.
    Li, Yun
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (05) : 794 - 808
  • [3] Comparison of Methods and Sampling Designs to Test for Association Between Rare Variants and Quantitative Traits
    Bacanu, Silviu-Alin
    Nelson, Matthew R.
    Whittaker, John C.
    [J]. GENETIC EPIDEMIOLOGY, 2011, 35 (04) : 226 - 235
  • [4] A Fast and Noise-Resilient Approach to Detect Rare-Variant Associations With Deep Sequencing Data for Complex Disorders
    Cheung, Yee Him
    Wang, Gao
    Leal, Suzanne M.
    Wang, Shuang
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 675 - 685
  • [5] Multiple rare Alleles contribute to low plasma levels of HDL cholesterol
    Cohen, JC
    Kiss, RS
    Pertsemlidis, A
    Marcel, YL
    McPherson, R
    Hobbs, HH
    [J]. SCIENCE, 2004, 305 (5685) : 869 - 872
  • [6] Practical aspects of imputation-driven meta-analysis of genome-wide association studies
    de Bakker, Paul I. W.
    Ferreira, Manuel A. R.
    Jia, Xiaoming
    Neale, Benjamin M.
    Raychaudhuri, Soumya
    Voight, Benjamin F.
    [J]. HUMAN MOLECULAR GENETICS, 2008, 17 : R122 - R128
  • [7] Rare Variants Create Synthetic Genome-Wide Associations
    Dickson, Samuel P.
    Wang, Kai
    Krantz, Ian
    Hakonarson, Hakon
    Goldstein, David B.
    [J]. PLOS BIOLOGY, 2010, 8 (01)
  • [8] Least angle regression - Rejoinder
    Efron, B
    Hastie, T
    Johnstone, I
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
  • [9] VIEWPOINT Missing heritability and strategies for finding the underlying causes of complex disease
    Eichler, Evan E.
    Flint, Jonathan
    Gibson, Greg
    Kong, Augustine
    Leal, Suzanne M.
    Moore, Jason H.
    Nadeau, Joseph H.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (06) : 446 - 450
  • [10] A Permutation Procedure to Correct for Confounders in Case-Control Studies, Including Tests of Rare Variation
    Epstein, Michael P.
    Duncan, Richard
    Jiang, Yunxuan
    Conneely, Karen N.
    Allen, Andrew S.
    Satten, Glen A.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (02) : 215 - 223