Association studies for next-generation sequencing

被引:71
作者
Luo, Li [1 ]
Boerwinkle, Eric [1 ]
Xiong, Momiao [1 ]
机构
[1] Univ Texas Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
RARE VARIANTS; GENETIC-VARIATION; COMPLEX TRAITS; DISEASES; HAPLOTYPE;
D O I
10.1101/gr.115998.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies (GWAS) have become the primary approach for identifying genes with common variants influencing complex diseases. Despite considerable progress, the common variations identified by GWAS account for only a small fraction of disease heritability and are unlikely to explain the majority of phenotypic variations of common diseases. A potential source of the missing heritability is the contribution of rare variants. Next-generation sequencing technologies will detect millions of novel rare variants, but these technologies have three defining features: identification of a large number of rare variants, a high proportion of sequence errors, and a large proportion of missing data. These features raise challenges for testing the association of rare variants with phenotypes of interest. In this study, we use a genome continuum model and functional principal components as a general principle for developing novel and powerful association analysis methods designed for resequencing data. We use simulations to calculate the type I error rates and the power of nine alternative statistics: two functional principal component analysis (FPCA)-based statistics, the multivariate principal component analysis (MPCA)-based statistic, the weighted sum (WSS), the variable-threshold (VT) method, the generalized T-2, the collapsing method, the CMC method, and individual chi(2) tests. We also examined the impact of sequence errors on their type I error rates. Finally, we apply the nine statistics to the published resequencing data set from ANGPTL4 in the Dallas Heart Study. We report that FPCA-based statistics have a higher power to detect association of rare variants and a stronger ability to filter sequence errors than the other seven methods.
引用
收藏
页码:1099 / 1108
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 1990, Variational Methods: Applications to Non-linear Partial Differential Equations and Hamiltonian Systems
[2]   Statistical analysis strategies for association studies involving rare variants [J].
Bansal, Vikas ;
Libiger, Ondrej ;
Torkamani, Ali ;
Schork, Nicholas J. .
NATURE REVIEWS GENETICS, 2010, 11 (11) :773-785
[3]   Accurate detection and genotyping of SNPs utilizing population sequencing data [J].
Bansal, Vikas ;
Harismendy, Olivier ;
Tewhey, Ryan ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. ;
Frazer, Kelly A. .
GENOME RESEARCH, 2010, 20 (04) :537-545
[4]  
Bickeboller H, 1996, GENETICS, V143, P1043
[5]   De novo fragment assembly with short mate-paired reads: Does the read length matter? [J].
Chaisson, Mark J. ;
Brinza, Dumitru ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2009, 19 (02) :336-346
[6]   Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels [J].
Cohen, JC ;
Pertsemlidis, A ;
Fahmi, S ;
Esmail, S ;
Vega, GL ;
Grundy, SM ;
Hobbs, HH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (06) :1810-1815
[7]   Rare Variants Create Synthetic Genome-Wide Associations [J].
Dickson, Samuel P. ;
Wang, Kai ;
Krantz, Ian ;
Hakonarson, Hakon ;
Goldstein, David B. .
PLOS BIOLOGY, 2010, 8 (01)
[8]   Human genetic variation and its contribution to complex traits [J].
Frazer, Kelly A. ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. .
NATURE REVIEWS GENETICS, 2009, 10 (04) :241-251
[9]   Shifting paradigm of association studies: Value of rare single-nucleotide polymorphisms [J].
Gorlov, Ivan P. ;
Gorlova, Olga Y. ;
Sunyaev, Shamil R. ;
Spitz, Margaret R. ;
Amos, Christopher I. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (01) :100-112
[10]   Evaluation of next generation sequencing platforms for population targeted sequencing studies [J].
Harismendy, Olivier ;
Ng, Pauline C. ;
Strausberg, Robert L. ;
Wang, Xiaoyun ;
Stockwell, Timothy B. ;
Beeson, Karen Y. ;
Schork, Nicholas J. ;
Murray, Sarah S. ;
Topol, Eric J. ;
Levy, Samuel ;
Frazer, Kelly A. .
GENOME BIOLOGY, 2009, 10 (03)