Bayesian network-guided sparse regression with flexible varying effects

被引:0
作者
Ren, Yangfan [1 ]
Peterson, Christine B. [2 ]
Vannucci, Marina [1 ]
机构
[1] Rice Univ, Dept Stat, 6100 Main St, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
关键词
Bayesian variable selection; Gaussian process prior; graphical model; spike-and-slab prior; varying coefficient model; VARIABLE SELECTION; MODELS; MICROBIOTA; PRIORS; DIET;
D O I
10.1093/biomtc/ujae111
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose Varying Effects Regression with Graph Estimation (VERGE), a novel Bayesian method for feature selection in regression. Our model has key aspects that allow it to leverage the complex structure of data sets arising from genomics or imaging studies. We distinguish between the predictors, which are the features utilized in the outcome prediction model, and the subject-level covariates, which modulate the effects of the predictors on the outcome. We construct a varying coefficients modeling framework where we infer a network among the predictor variables and utilize this network information to encourage the selection of related predictors. We employ variable selection spike-and-slab priors that enable the selection of both network-linked predictor variables and covariates that modify the predictor effects. We demonstrate through simulation studies that our method outperforms existing alternative methods in terms of both feature selection and predictive accuracy. We illustrate VERGE with an application to characterizing the influence of gut microbiome features on obesity, where we identify a set of microbial taxa and their ecological dependence relations. We allow subject-level covariates, including sex and dietary intake variables to modify the coefficients of the microbiome predictors, providing additional insight into the interplay between these factors.
引用
收藏
页数:10
相关论文
共 39 条
[21]   Sex-specific association between gut microbiome and fat distribution [J].
Min, Yan ;
Ma, Xiaoguang ;
Sankaran, Kris ;
Ru, Yuan ;
Chen, Lijin ;
Baiocchi, Mike ;
Zhu, Shankuan .
NATURE COMMUNICATIONS, 2019, 10 (1)
[22]  
Neal RM, 1999, BAYESIAN STATISTICS 6, P475
[23]   Bayesian Hierarchical Varying-Sparsity Regression Models with Application to Cancer Proteogenomics [J].
Ni, Yang ;
Stingo, Francesco C. ;
Ha, Min Jin ;
Akbani, Rehan ;
Baladandayuthapani, Veerabhadran .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) :48-60
[24]   A taxonomic signature of obesity in a large study of American adults [J].
Peters, Brandilyn A. ;
Shapiro, Jean A. ;
Church, Timothy R. ;
Miller, George ;
Trinh-Shevrin, Chau ;
Yuen, Elizabeth ;
Friedlander, Charles ;
Hayes, Richard B. ;
Ahn, Jiyoung .
SCIENTIFIC REPORTS, 2018, 8
[25]   Joint Bayesian variable and graph selection for regression models with network-structured predictors [J].
Peterson, Christine B. ;
Stingo, Francesco C. ;
Vannucci, Marina .
STATISTICS IN MEDICINE, 2016, 35 (07) :1017-1031
[26]   Gut Microbiome Composition in Obese and Non-Obese Persons: A Systematic Review and Meta-Analysis [J].
Pinart, Mariona ;
Doetsch, Andreas ;
Schlicht, Kristina ;
Laudes, Matthias ;
Bouwman, Jildau ;
Forslund, Sofia K. ;
Pischon, Tobias ;
Nimptsch, Katharina .
NUTRIENTS, 2022, 14 (01)
[27]   Bayesian Variable Selection for Multivariate Spatially Varying Coefficient Regression [J].
Reich, Brian J. ;
Fuentes, Montserrat ;
Herring, Amy H. ;
Evenson, Kelly R. .
BIOMETRICS, 2010, 66 (03) :772-782
[28]   Variable Selection for Nonparametric Gaussian Process Priors: Models and Computational Strategies [J].
Savitsky, Terrance ;
Vannucci, Marina ;
Sha, Naijun .
STATISTICAL SCIENCE, 2011, 26 (01) :130-149
[29]   Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models [J].
Scheipl, Fabian ;
Fahrmeir, Ludwig ;
Kneib, Thomas .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) :1518-1532
[30]  
Seeger Matthias, 2004, Int J Neural Syst, V14, P69, DOI 10.1142/S0129065704001899