A functional U-statistic method for association analysis of sequencing data

被引:3
作者
Jadhav, Sneha [1 ]
Tong, Xiaoran [2 ]
Lu, Qing [2 ]
机构
[1] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Epidemiol & Biostat, 909 Fee Rd,Room 601, E Lansing, MI 48824 USA
关键词
Functional data analysis; multivariate method; nonparametric method; similarity measure; GENOME-WIDE ASSOCIATION; GENE-ENVIRONMENT INTERACTIONS; RISK-FACTORS; ATHEROSCLEROSIS RISK; MODEL; LOCI; METAANALYSIS; TRAITS; ZFHX3; EPIDEMIOLOGY;
D O I
10.1002/gepi.22063
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Although sequencing studies hold great promise for uncovering novel variants predisposing to human diseases, the high dimensionality of the sequencing data brings tremendous challenges to data analysis. Moreover, for many complex diseases (e.g., psychiatric disorders) multiple related phenotypes are collected. These phenotypes can be different measurements of an underlying disease, or measurements characterizing multiple related diseases for studying common genetic mechanism. Although jointly analyzing these phenotypes could potentially increase the power of identifying disease-associated genes, the different types of phenotypes pose challenges for association analysis. To address these challenges, we propose a nonparametric method, functional U-statistic method (FU), for multivariate analysis of sequencing data. It first constructs smooth functions from individuals' sequencing data, and then tests the association of these functions with multiple phenotypes by using a U-statistic. The method provides a general framework for analyzing various types of phenotypes (e.g., binary and continuous phenotypes) with unknown distributions. Fitting the genetic variants within a gene using a smoothing function also allows us to capture complexities of gene structure (e.g., linkage disequilibrium, LD), which could potentially increase the power of association analysis. Through simulations, we compared our method to themultivariate outcome score test (MOST), and found that our test attained better performance than MOST. In a real data application, we apply our method to the sequencing data from Minnesota Twin Study (MTS) and found potential associations of several nicotine receptor subunit (CHRN) genes, including CHRNB3, associated with nicotine dependence and/or alcohol dependence.
引用
收藏
页码:636 / 643
页数:8
相关论文
共 74 条
[1]   NONPARAMETRIC INFERENCE FOR A FAMILY OF COUNTING PROCESSES [J].
AALEN, O .
ANNALS OF STATISTICS, 1978, 6 (04) :701-726
[2]  
Akaike H., 1998, Selected papers of Hirotugu Akaike, P199, DOI [10.1007/978-1-4612-1694-0_15, DOI 10.1007/978-1-4612-1694-0_15]
[3]   Joint analyses of longitudinal and time-to-event data in research on aging: implications for predicting health and survival [J].
Arbeev, Konstantin G. ;
Akushevich, Igor ;
Kulminski, Alexander M. ;
Ukraintseva, Svetlana V. ;
Yashin, Anatoliy I. .
FRONTIERS IN PUBLIC HEALTH, 2014, 2
[4]   Genetic model for longitudinal studies of aging, health, and longevity and its potential application to incomplete data [J].
Arbeev, Konstantin G. ;
Akushevich, Igor ;
Kulminski, Alexander M. ;
Arbeeva, Liubov S. ;
Akushevich, Lucy ;
Ukraintseva, Svetlana V. ;
Culminskaya, Irina V. ;
Yashin, Anatoli I. .
JOURNAL OF THEORETICAL BIOLOGY, 2009, 258 (01) :103-111
[5]  
Ascher U.M., 1998, Computer Methods for Ordinary Differential Equations and Differential-Algebraic Equations, P3, DOI [DOI 10.1137/1.9781611971392, 10.1137/1.9781611971392]
[6]   Genome-Wide Association of Lipid-Lowering Response to Statins in Combined Study Populations [J].
Barber, Mathew J. ;
Mangravite, Lara M. ;
Hyde, Craig L. ;
Chasman, Daniel I. ;
Smith, Joshua D. ;
McCarty, Catherine A. ;
Li, Xiaohui ;
Wilke, Russell A. ;
Rieder, Mark J. ;
Williams, Paul T. ;
Ridker, Paul M. ;
Chatterjee, Aurobindo ;
Rotter, Jerome I. ;
Nickerson, Deborah A. ;
Stephens, Matthew ;
Krauss, Ronald M. .
PLOS ONE, 2010, 5 (03)
[7]   Gene-environment interaction research in psychiatric epidemiology: a framework and implications for study design [J].
Belsky, Daniel W. ;
Suppli, Nis Palm ;
Israel, Salomon .
SOCIAL PSYCHIATRY AND PSYCHIATRIC EPIDEMIOLOGY, 2014, 49 (10) :1525-1529
[8]   A genome-wide association meta-analysis identifies new childhood obesity loci [J].
Bradfield, Jonathan P. ;
Taal, H. Rob ;
Timpson, Nicholas J. ;
Scherag, Andre ;
Lecoeur, Cecile ;
Warrington, Nicole M. ;
Hypponen, Elina ;
Holst, Claus ;
Valcarcel, Beatriz ;
Thiering, Elisabeth ;
Salem, Rany M. ;
Schumacher, Fredrick R. ;
Cousminer, Diana L. ;
Sleiman, Patrick M. A. ;
Zhao, Jianhua ;
Berkowitz, Robert I. ;
Vimaleswaran, Karani S. ;
Jarick, Ivonne ;
Pennell, Craig E. ;
Evans, David M. ;
St Pourcain, Beate ;
Berry, Diane J. ;
Mook-Kanamori, Dennis O. ;
Hofman, Albert ;
Rivadeneira, Fernando ;
Uitterlinden, Andre G. ;
van Duijn, Cornelia M. ;
van der Valk, Ralf J. P. ;
de Jongste, Johan C. ;
Postma, Dirkje S. ;
Boomsma, Dorret I. ;
Gauderman, W. James ;
Hassanein, Mohamed T. ;
Lindgren, Cecilia M. ;
Magi, Reedik ;
Boreham, Colin A. G. ;
Neville, Charlotte E. ;
Moreno, Luis A. ;
Elliott, Paul ;
Pouta, Anneli ;
Hartikainen, Anna-Liisa ;
Li, Mingyao ;
Raitakari, Olli ;
Lehtimaki, Terho ;
Eriksson, Johan G. ;
Palotie, Aarno ;
Dallongeville, Jean ;
Das, Shikta ;
Deloukas, Panos ;
McMahon, George .
NATURE GENETICS, 2012, 44 (05) :526-+
[9]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[10]   A LIMITED MEMORY ALGORITHM FOR BOUND CONSTRAINED OPTIMIZATION [J].
BYRD, RH ;
LU, PH ;
NOCEDAL, J ;
ZHU, CY .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (05) :1190-1208