Quantitative trait locus analysis for next-generation sequencing with the functional linear models

被引:37
作者
Luo, Li [2 ]
Zhu, Yun [1 ]
Xiong, Momiao [1 ]
机构
[1] Univ Texas Houston, Sch Publ Hlth, Ctr Human Genet, Dept Biostat, Houston, TX USA
[2] Univ New Mexico, Div Epidemiol Biostat & Prevent Med, Albuquerque, NM 87131 USA
基金
美国国家卫生研究院;
关键词
RARE VARIANTS; GENETIC ASSOCIATION; SCALAR RESPONSE; POPULATION; COMMON; REGRESSION; DISEASES; READS; TESTS; VIEW;
D O I
10.1136/jmedgenet-2012-100798
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background Although in the past few years we have witnessed the rapid development of novel statistical methods for association studies of qualitative traits using next generation sequencing (NGS) data, only a few statistics are proposed for testing the association of rare variants with quantitative traits. The quantitative trait locus (QTL) analysis of rare variants remains challenging. Analysis from low dimensional data to high dimensional genomic data demands changes in statistical methods from multivariate data analysis to functional data analysis. Methods We propose a functional linear model (FLM) as a general principle for developing novel and powerful QTL analysis methods designed for resequencing data. By simulations we calculated the type I error rates and evaluated the power of the FLM and other eight existing statistical methods, even in the presence of both positive and negative signs of effects. Results Since the FLM retains all of the genetic information in the data and explores the merits of both variant-by-variant and collective analysis and overcomes their limitation, the FLM has a much higher power than other existing statistics in all the scenarios considered. To evaluate its performance further, the FLM was applied to association analysis of six quantitative traits in the Dallas Heart Study, and RNA-seq eQTL analysis with genetic variation in the low coverage resequencing data of the 1000 Genomes Project. Real data analysis showed that the FLM had much smaller p values to identify significantly associated variants than other existing methods. Conclusions The FLM is expected to open a new route for QTL analysis.
引用
收藏
页码:513 / 524
页数:12
相关论文
共 50 条
  • [1] Ando T, 2004, LECT NOTES ARTIF INT, V3303, P69
  • [2] Comparison of Methods and Sampling Designs to Test for Association Between Rare Variants and Quantitative Traits
    Bacanu, Silviu-Alin
    Nelson, Matthew R.
    Whittaker, John C.
    [J]. GENETIC EPIDEMIOLOGY, 2011, 35 (04) : 226 - 235
  • [3] Local linear regression for functional predictor and scalar response
    Baillo, Amparo
    Grane, Aurea
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2009, 100 (01) : 102 - 111
  • [4] Statistical analysis strategies for association studies involving rare variants
    Bansal, Vikas
    Libiger, Ondrej
    Torkamani, Ali
    Schork, Nicholas J.
    [J]. NATURE REVIEWS GENETICS, 2010, 11 (11) : 773 - 785
  • [5] Autoregressive forecasting of some functional climatic variations
    Besse, PC
    Cardot, H
    Stephenson, DB
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2000, 27 (04) : 673 - 687
  • [6] Functional linear model
    Cardot, H
    Ferraty, F
    Sarda, P
    [J]. STATISTICS & PROBABILITY LETTERS, 1999, 45 (01) : 11 - 22
  • [7] Testing hypotheses in the functional linear model
    Cardot, H
    Ferraty, F
    Mas, A
    Sarda, P
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (01) : 241 - 255
  • [8] De novo fragment assembly with short mate-paired reads: Does the read length matter?
    Chaisson, Mark J.
    Brinza, Dumitru
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (02) : 336 - 346
  • [9] Common functional principal components analysis: A new approach to analyzing human movement data
    Coffey, N.
    Harrison, A. J.
    Donoghue, O. A.
    Hayes, K.
    [J]. HUMAN MOVEMENT SCIENCE, 2011, 30 (06) : 1144 - 1166
  • [10] Human Genome Sequencing Using Unchained Base Reads on Self-Assembling DNA Nanoarrays
    Drmanac, Radoje
    Sparks, Andrew B.
    Callow, Matthew J.
    Halpern, Aaron L.
    Burns, Norman L.
    Kermani, Bahram G.
    Carnevali, Paolo
    Nazarenko, Igor
    Nilsen, Geoffrey B.
    Yeung, George
    Dahl, Fredrik
    Fernandez, Andres
    Staker, Bryan
    Pant, Krishna P.
    Baccash, Jonathan
    Borcherding, Adam P.
    Brownley, Anushka
    Cedeno, Ryan
    Chen, Linsu
    Chernikoff, Dan
    Cheung, Alex
    Chirita, Razvan
    Curson, Benjamin
    Ebert, Jessica C.
    Hacker, Coleen R.
    Hartlage, Robert
    Hauser, Brian
    Huang, Steve
    Jiang, Yuan
    Karpinchyk, Vitali
    Koenig, Mark
    Kong, Calvin
    Landers, Tom
    Le, Catherine
    Liu, Jia
    McBride, Celeste E.
    Morenzoni, Matt
    Morey, Robert E.
    Mutch, Karl
    Perazich, Helena
    Perry, Kimberly
    Peters, Brock A.
    Peterson, Joe
    Pethiyagoda, Charit L.
    Pothuraju, Kaliprasad
    Richter, Claudia
    Rosenbaum, Abraham M.
    Roy, Shaunak
    Shafto, Jay
    Sharanhovich, Uladzislau
    [J]. SCIENCE, 2010, 327 (5961) : 78 - 81