Simultaneous Variable and Covariance Selection With the Multivariate Spike-and-Slab LASSO

被引:39
作者
Deshpande, Sameer K. [1 ]
Rockova, Veronika [2 ]
George, Edward, I [3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Univ Chicago, Booth Sch Business, Dept Econometr & Stat, Chicago, IL 60637 USA
[3] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
关键词
Bayesian shrinkage; EM algorithm; Gaussian graphical modeling; Multivariate regression; Nonconvex optimization; MAXIMUM-LIKELIHOOD-ESTIMATION; REGRESSION ADJUSTMENT; WISHART DISTRIBUTIONS; MATRIX ESTIMATION; BIAS; MODEL; REGULARIZATION; GENOMICS;
D O I
10.1080/10618600.2019.1593179
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova and George targeting modal estimates of the matrix of regression coefficients and residual precision matrix. Varying the scale of the continuous spike densities facilitates dynamic posterior exploration and allows us to filter out negligible regression coefficients and partial covariances gradually. Our method is seen to substantially outperform regularization competitors on simulated data. We demonstrate our method with a re-examination of data from a recent observational study of the effect of playing high school football on several later-life cognition, psychological, and socio-economic outcomes. An R package, scripts for replicating examples in this article, and results from further simulation studies are provided in the available online.
引用
收藏
页码:921 / 931
页数:11
相关论文
共 50 条
[1]   Sparse time series chain graphical models for reconstructing genetic networks [J].
Abegaz, Fentaw ;
Wit, Ernst .
BIOSTATISTICS, 2013, 14 (03) :586-599
[2]  
[Anonymous], 2018, GLASSO GRAPHICAL LAS
[3]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[4]   Bayesian structure learning in graphical models [J].
Banerjee, Sayantan ;
Ghosal, Subhashis .
JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 136 :147-162
[5]   Joint High-Dimensional Bayesian Variable and Covariance Selection with an Application to eQTL Analysis [J].
Bhadra, Anindya ;
Mallick, Bani K. .
BIOMETRICS, 2013, 69 (02) :447-457
[6]   Predicting multivariate responses in multiple linear regression [J].
Breiman, L ;
Friedman, JH .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1997, 59 (01) :3-37
[7]   Multivariate Bayesian variable selection and prediction [J].
Brown, PJ ;
Vannucci, M ;
Fearn, T .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1998, 60 :627-641
[8]   Covariate-adjusted precision matrix estimation with an application in genetical genomics [J].
Cai, T. Tony ;
Li, Hongzhe ;
Liu, Weidong ;
Xie, Jichun .
BIOMETRIKA, 2013, 100 (01) :139-156
[9]   Simulation of hyper-inverse Wishart distributions in graphical models [J].
Carvalho, Carlos M. ;
Massam, Helene ;
West, Mike .
BIOMETRIKA, 2007, 94 (03) :647-659
[10]  
COCHRAN WG, 1973, SANKHYA SER A, V35, P417