Association studies for next-generation sequencing

被引:71
作者
Luo, Li [1 ]
Boerwinkle, Eric [1 ]
Xiong, Momiao [1 ]
机构
[1] Univ Texas Houston, Sch Publ Hlth, Ctr Human Genet, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
RARE VARIANTS; GENETIC-VARIATION; COMPLEX TRAITS; DISEASES; HAPLOTYPE;
D O I
10.1101/gr.115998.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies (GWAS) have become the primary approach for identifying genes with common variants influencing complex diseases. Despite considerable progress, the common variations identified by GWAS account for only a small fraction of disease heritability and are unlikely to explain the majority of phenotypic variations of common diseases. A potential source of the missing heritability is the contribution of rare variants. Next-generation sequencing technologies will detect millions of novel rare variants, but these technologies have three defining features: identification of a large number of rare variants, a high proportion of sequence errors, and a large proportion of missing data. These features raise challenges for testing the association of rare variants with phenotypes of interest. In this study, we use a genome continuum model and functional principal components as a general principle for developing novel and powerful association analysis methods designed for resequencing data. We use simulations to calculate the type I error rates and the power of nine alternative statistics: two functional principal component analysis (FPCA)-based statistics, the multivariate principal component analysis (MPCA)-based statistic, the weighted sum (WSS), the variable-threshold (VT) method, the generalized T-2, the collapsing method, the CMC method, and individual chi(2) tests. We also examined the impact of sequence errors on their type I error rates. Finally, we apply the nine statistics to the published resequencing data set from ANGPTL4 in the Dallas Heart Study. We report that FPCA-based statistics have a higher power to detect association of rare variants and a stronger ability to filter sequence errors than the other seven methods.
引用
收藏
页码:1099 / 1108
页数:10
相关论文
共 31 条
[11]  
Henderson D, 2006, STOCHASTIC DIFFERENTIAL EQUATIONS IN SCIENCE AND ENGINEERING, P1, DOI 10.1142/9789812774798
[12]   Potential etiologic and functional implications of genome-wide association loci for human diseases and traits [J].
Hindorff, Lucia A. ;
Sethupathy, Praveen ;
Junkins, Heather A. ;
Ramos, Erin M. ;
Mehta, Jayashri P. ;
Collins, Francis S. ;
Manolio, Teri A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) :9362-9367
[13]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338
[14]   Rare independent mutations in renal salt handling genes contribute to blood pressure variation [J].
Ji, Weizhen ;
Foo, Jia Nee ;
O'Roak, Brian J. ;
Zhao, Hongyu ;
Larson, Martin G. ;
Simon, David B. ;
Newton-Cheh, Christopher ;
State, Matthew W. ;
Levy, Daniel ;
Lifton, Richard P. .
NATURE GENETICS, 2008, 40 (05) :592-599
[15]   Accounting for bias from sequencing error in population genetic estimates [J].
Johnson, Philip L. F. ;
Slatkin, Montgomery .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (01) :199-206
[16]   THE DISTRIBUTION OF RARE ALLELES [J].
JOYCE, P ;
TAVARE, S .
JOURNAL OF MATHEMATICAL BIOLOGY, 1995, 33 (06) :602-618
[17]   Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data [J].
Li, Bingshan ;
Leal, Suzanne M. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (03) :311-321
[18]   To Identify Associations with Rare Variants, Just WHaIT Weighted Haplotype and Imputation-Based Tests [J].
Li, Yun ;
Byrnes, Andrea E. ;
Li, Mingyao .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 87 (05) :728-735
[19]   Estimation of Allele Frequencies From High-Coverage Genome-Sequencing Projects [J].
Lynch, Michael .
GENETICS, 2009, 182 (01) :295-301
[20]   A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic [J].
Madsen, Bo Eskerod ;
Browning, Sharon R. .
PLOS GENETICS, 2009, 5 (02)