Bayesian network-guided sparse regression with flexible varying effects

被引:0
作者
Ren, Yangfan [1 ]
Peterson, Christine B. [2 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, 6100 Main St, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
Bayesian variable selection; Gaussian process prior; graphical model; spike-and-slab prior; varying coefficient model; VARIABLE SELECTION; MODELS; MICROBIOTA; PRIORS; DIET;
D O I
10.1093/biomtc/ujae111
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose Varying Effects Regression with Graph Estimation (VERGE), a novel Bayesian method for feature selection in regression. Our model has key aspects that allow it to leverage the complex structure of data sets arising from genomics or imaging studies. We distinguish between the predictors, which are the features utilized in the outcome prediction model, and the subject-level covariates, which modulate the effects of the predictors on the outcome. We construct a varying coefficients modeling framework where we infer a network among the predictor variables and utilize this network information to encourage the selection of related predictors. We employ variable selection spike-and-slab priors that enable the selection of both network-linked predictor variables and covariates that modify the predictor effects. We demonstrate through simulation studies that our method outperforms existing alternative methods in terms of both feature selection and predictive accuracy. We illustrate VERGE with an application to characterizing the influence of gut microbiome features on obesity, where we identify a set of microbial taxa and their ecological dependence relations. We allow subject-level covariates, including sex and dietary intake variables to modify the coefficients of the microbiome predictors, providing additional insight into the interplay between these factors.
引用
收藏
页数:10
相关论文
共 39 条
[1]  
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[2]  
[Anonymous], 1991, Stat Comput, DOI DOI 10.1007/BF01890836
[3]   Optimal predictive model selection [J].
Barbieri, MM ;
Berger, JO .
ANNALS OF STATISTICS, 2004, 32 (03) :870-897
[4]   Tree-based varying coefficient regression for longitudinal ordinal responses [J].
Buergin, Reto ;
Ritschard, Gilbert .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 86 :65-80
[5]   The role of short-chain fatty acids in the interplay between diet, gut microbiota, and host energy metabolism [J].
den Besten, Gijs ;
van Eunen, Karen ;
Groen, Albert K. ;
Venema, Koen ;
Reijngoud, Dirk-Jan ;
Bakker, Barbara M. .
JOURNAL OF LIPID RESEARCH, 2013, 54 (09) :2325-2340
[6]   The gut microbiome: Relationships with disease and opportunities for therapy [J].
Durack, Juliana ;
Lynch, Susan V. .
JOURNAL OF EXPERIMENTAL MEDICINE, 2019, 216 (01) :20-40
[7]  
George EI, 1997, STAT SINICA, V7, P339
[8]   Compositional zero-inflated network estimation for microbiome data [J].
Ha, Min Jin ;
Kim, Junghi ;
Galloway-Pena, Jessica ;
Kim-Anh Do ;
Peterson, Christine B. .
BMC BIOINFORMATICS, 2020, 21 (Suppl 21)
[9]  
HASTIE T, 1993, J ROY STAT SOC B MET, V55, P757
[10]   svReg: Structural varying-coefficient regression to differentiate how regional brain atrophy affects motor impairment for Huntington disease severity groups [J].
Kim, Rakheon ;
Mueller, Samuel ;
Garcia, Tanya P. .
BIOMETRICAL JOURNAL, 2021, 63 (06) :1254-1271