Association studies for next-generation sequencing

被引:71
作者
Luo, Li [1 ]
Boerwinkle, Eric [1 ]
Xiong, Momiao [1 ]
机构
[1] Univ Texas Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
RARE VARIANTS; GENETIC-VARIATION; COMPLEX TRAITS; DISEASES; HAPLOTYPE;
D O I
10.1101/gr.115998.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies (GWAS) have become the primary approach for identifying genes with common variants influencing complex diseases. Despite considerable progress, the common variations identified by GWAS account for only a small fraction of disease heritability and are unlikely to explain the majority of phenotypic variations of common diseases. A potential source of the missing heritability is the contribution of rare variants. Next-generation sequencing technologies will detect millions of novel rare variants, but these technologies have three defining features: identification of a large number of rare variants, a high proportion of sequence errors, and a large proportion of missing data. These features raise challenges for testing the association of rare variants with phenotypes of interest. In this study, we use a genome continuum model and functional principal components as a general principle for developing novel and powerful association analysis methods designed for resequencing data. We use simulations to calculate the type I error rates and the power of nine alternative statistics: two functional principal component analysis (FPCA)-based statistics, the multivariate principal component analysis (MPCA)-based statistic, the weighted sum (WSS), the variable-threshold (VT) method, the generalized T-2, the collapsing method, the CMC method, and individual chi(2) tests. We also examined the impact of sequence errors on their type I error rates. Finally, we apply the nine statistics to the published resequencing data set from ANGPTL4 in the Dallas Heart Study. We report that FPCA-based statistics have a higher power to detect association of rare variants and a stronger ability to filter sequence errors than the other seven methods.
引用
收藏
页码:1099 / 1108
页数:10
相关论文
共 31 条
  • [1] [Anonymous], 1990, Variational Methods: Applications to Non-linear Partial Differential Equations and Hamiltonian Systems
  • [2] Statistical analysis strategies for association studies involving rare variants
    Bansal, Vikas
    Libiger, Ondrej
    Torkamani, Ali
    Schork, Nicholas J.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (11) : 773 - 785
  • [3] Accurate detection and genotyping of SNPs utilizing population sequencing data
    Bansal, Vikas
    Harismendy, Olivier
    Tewhey, Ryan
    Murray, Sarah S.
    Schork, Nicholas J.
    Topol, Eric J.
    Frazer, Kelly A.
    [J]. GENOME RESEARCH, 2010, 20 (04) : 537 - 545
  • [4] Bickeboller H, 1996, GENETICS, V143, P1043
  • [5] De novo fragment assembly with short mate-paired reads: Does the read length matter?
    Chaisson, Mark J.
    Brinza, Dumitru
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (02) : 336 - 346
  • [6] Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels
    Cohen, JC
    Pertsemlidis, A
    Fahmi, S
    Esmail, S
    Vega, GL
    Grundy, SM
    Hobbs, HH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (06) : 1810 - 1815
  • [7] Rare Variants Create Synthetic Genome-Wide Associations
    Dickson, Samuel P.
    Wang, Kai
    Krantz, Ian
    Hakonarson, Hakon
    Goldstein, David B.
    [J]. PLOS BIOLOGY, 2010, 8 (01)
  • [8] Human genetic variation and its contribution to complex traits
    Frazer, Kelly A.
    Murray, Sarah S.
    Schork, Nicholas J.
    Topol, Eric J.
    [J]. NATURE REVIEWS GENETICS, 2009, 10 (04) : 241 - 251
  • [9] Shifting paradigm of association studies: Value of rare single-nucleotide polymorphisms
    Gorlov, Ivan P.
    Gorlova, Olga Y.
    Sunyaev, Shamil R.
    Spitz, Margaret R.
    Amos, Christopher I.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (01) : 100 - 112
  • [10] Evaluation of next generation sequencing platforms for population targeted sequencing studies
    Harismendy, Olivier
    Ng, Pauline C.
    Strausberg, Robert L.
    Wang, Xiaoyun
    Stockwell, Timothy B.
    Beeson, Karen Y.
    Schork, Nicholas J.
    Murray, Sarah S.
    Topol, Eric J.
    Levy, Samuel
    Frazer, Kelly A.
    [J]. GENOME BIOLOGY, 2009, 10 (03):