Spike-and-Slab Shrinkage Priors for Structurally Sparse Bayesian Neural Networks

被引:0
|
作者
Jantre, Sanket [1 ]
Bhattacharya, Shrijita [2 ]
Maiti, Tapabrata [2 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI USA
关键词
Bayes methods; Neurons; Biological neural networks; Vectors; Slabs; Predictive models; Linear regression; Computational efficiency; Training; Network topology; Bayesian neural networks (BNNs); posterior consistency; spike-and-slab (SS) priors; structured sparsity; variational inference; POSTERIOR CONSISTENCY; HORSESHOE; MODEL; RELU;
D O I
10.1109/TNNLS.2024.3485529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily overparameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g., node sparsity) provide low-latency inference, higher data throughput, and reduced energy consumption. In this article, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks (BNNs). To this end, we propose structurally sparse BNNs, which systematically prune excessive nodes with the following: 1) spike-and-slab group Lasso (SS-GL) and 2) SS group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference, including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layerwise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared with the baseline models in prediction accuracy, model compression, and inference latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] An Adaptive Bayesian Lasso Approach with Spike-and-Slab Priors to Identify Multiple Linear and Nonlinear Effects in Structural Equation Models
    Brandt, Holger
    Cambria, Jenna
    Kelava, Augustin
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2018, 25 (06) : 946 - 960
  • [32] Bayesian inference with spike-and-slab priors for differential item functioning detection in a multiple-group IRT tree model
    Chang, Yu-Wei
    Yang, Cheng-Xin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (07) : 1416 - 1444
  • [33] REDOMA: Bayesian random-effects dose-optimization meta-analysis using spike-and-slab priors
    Yang, Cheng-Han
    Kwiatkowski, Evan
    Lee, J. Jack
    Lin, Ruitao
    STATISTICS IN MEDICINE, 2024, 43 (18) : 3484 - 3502
  • [34] Comparing Spike and Slab Priors for Bayesian Variable Selection
    Malsiner-Walli, Gertraud
    Wagner, Helga
    AUSTRIAN JOURNAL OF STATISTICS, 2011, 40 (04) : 241 - 264
  • [35] MULTI-TASK IMAGE CLASSIFICATION VIA COLLABORATIVE, HIERARCHICAL SPIKE-AND-SLAB PRIORS
    Mousavi, Hojjat S.
    Srinivas, Umamahesh
    Monga, Vishal
    Suo, Yuanming
    Dao, Minh
    Tran, Trac D.
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4236 - 4240
  • [36] Spike-and-Slab Group Lassos for Grouped Regression and Sparse Generalized Additive Models
    Bai, Ray
    Moran, Gemma E.
    Antonelli, Joseph L.
    Chen, Yong
    Boland, Mary R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (537) : 184 - 197
  • [37] Disjunct support spike-and-slab priors for variable selection in regression under quasi-sparseness
    Andrade, Daniel
    Fukumizu, Kenji
    STAT, 2020, 9 (01):
  • [38] Spike and slab Bayesian sparse principal component analysis
    Ning, Yu-Chien Bo
    Ning, Ning
    STATISTICS AND COMPUTING, 2024, 34 (03)
  • [39] Bayesian Quantile Regression Based on the Empirical Likelihood with Spike and Slab Priors
    Xi, Ruibin
    Li, Yunxiao
    Hu, Yiming
    BAYESIAN ANALYSIS, 2016, 11 (03): : 821 - 855
  • [40] Variable selection and estimation in causal inference using Bayesian spike and slab priors
    Koch, Brandon
    Vock, David M.
    Wolfson, Julian
    Vock, Laura Boehm
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2445 - 2469