Semiparametric Kernel-Based Regression for Evaluating Interaction Between Pathway Effect and Covariate

被引:1
作者
Fang, Zaili [1 ]
Kim, Inyoung [1 ]
Jung, Jeesun [2 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Stat, 410A Hutcheson Hall, Blacksburg, VA 24061 USA
[2] NIAAA, Lab Epidemiol & Biometry, NIH, Bethesda, MD USA
基金
美国国家科学基金会;
关键词
Gaussian random process; Kernel machine; Pathway analysis; Semiparametric model; Smoothing splines; LIKELIHOOD RATIO TESTS; OXIDATIVE-PHOSPHORYLATION; CLINICAL-OUTCOMES; MIXED MODELS; ASSOCIATION; GENES; VARIANCE; POWERFUL; MACHINES; SPLINES;
D O I
10.1007/s13253-017-0317-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pathway-based analysis has the ability to detect subtle changes in response variables that could be missed when using gene-based analysis. Since genes interact with other covariates such as environmental or clinical variables, so do pathways, which are sets of genes that serve particular cellular or physiological functions. However, since pathways are sets of genes and since environmental or clinical variables do not have parametric relationships with response variables, it is difficult to model unknown interaction terms between high-dimensional variables and low-dimensional variables as environmental or clinical variables. In this paper, we propose a semiparametric interaction model for two unknown functions to evaluate the interaction between a pathway and environmental or clinical variable: for the pathway, we use an unknown high-dimensional function, and for environmental or clinical variable, we use an unknown low-dimensional function. We model the environmental or clinical variable nonparametrically via a natural cubic spline. We model both the pathway effect and the interaction between the pathway and environmental or clinical effect nonparametrically via a kernel machine. Since both interactions among genes within the same pathway and the interaction between the pathway and the environmental or clinical variables are complex, we allow for the possibility that a pathway is interacting with environmental or clinical variables and the genes within the same pathway are interacting with each other. We illustrate our approach using simulated data and genetic pathway data for type II diabetes. Supplementary materials accompanying this paper appear online.
引用
收藏
页码:129 / 152
页数:24
相关论文
共 32 条
[1]  
[Anonymous], 2006, GAUSSIAN PROCESSES M, DOI DOI 10.1142/S0129065704001899
[2]   THEORY OF REPRODUCING KERNELS [J].
ARONSZAJN, N .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) :337-404
[3]   Bayesian Semiparametric Model for Pathway-Based Analysis with Zero-Inflated Clinical Outcomes [J].
Cheng, Lulu ;
Kim, Inyoung ;
Pang, Herbert .
JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2016, 21 (04) :641-662
[4]   Restricted likelihood ratio lack-of-fit tests using mixed spline models [J].
Claeskens, G .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2004, 66 :909-926
[5]   Exact likelihood ratio tests for penalised splines [J].
Crainiceanu, C ;
Ruppert, D ;
Claeskens, G ;
Wand, MP .
BIOMETRIKA, 2005, 92 (01) :91-103
[6]  
Czyzyk A, 1989, Pol Arch Med Wewn, V81, P193
[7]   Flexible Variable Selection for Recovering Sparsity in Nonadditive Nonparametric Models [J].
Fang, Zaili ;
Kim, Inyoung ;
Schaumont, Patrick .
BIOMETRICS, 2016, 72 (04) :1155-1163
[8]   Taurine supplementation and diabetes mellitus [J].
Franconi, F ;
Loizzo, A ;
Ghirlanda, G ;
Seghieri, G .
CURRENT OPINION IN CLINICAL NUTRITION AND METABOLIC CARE, 2006, 9 (01) :32-36
[9]   A global test for groups of genes: testing association with a clinical outcome [J].
Goeman, JJ ;
van de Geer, SA ;
de Kort, F ;
van Houwelingen, HC .
BIOINFORMATICS, 2004, 20 (01) :93-99
[10]  
Green PJ, 1993, NONPARAMETRIC REGRES, DOI DOI 10.1007/978-1-4899-4473-3