Spike-and-Slab Shrinkage Priors for Structurally Sparse Bayesian Neural Networks

被引:0
|
作者
Jantre, Sanket [1 ]
Bhattacharya, Shrijita [2 ]
Maiti, Tapabrata [2 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI USA
关键词
Bayes methods; Neurons; Biological neural networks; Vectors; Slabs; Predictive models; Linear regression; Computational efficiency; Training; Network topology; Bayesian neural networks (BNNs); posterior consistency; spike-and-slab (SS) priors; structured sparsity; variational inference; POSTERIOR CONSISTENCY; HORSESHOE; MODEL; RELU;
D O I
10.1109/TNNLS.2024.3485529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily overparameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g., node sparsity) provide low-latency inference, higher data throughput, and reduced energy consumption. In this article, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks (BNNs). To this end, we propose structurally sparse BNNs, which systematically prune excessive nodes with the following: 1) spike-and-slab group Lasso (SS-GL) and 2) SS group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference, including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layerwise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared with the baseline models in prediction accuracy, model compression, and inference latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models
    Scheipl, Fabian
    Fahrmeir, Ludwig
    Kneib, Thomas
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1518 - 1532
  • [22] Bayesian Lesion Estimation with a Structured Spike-and-Slab Prior
    Menacher, Anna
    Nichols, Thomas E.
    Holmes, Chris
    Ganjgahi, Habib
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (545) : 66 - 80
  • [23] Bayesian Inference for Structured Spike and Slab Priors
    Andersen, Michael Riis
    Winther, Ole
    Hansen, Lars Kai
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [24] The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions
    Courville, Aaron
    Desjardins, Guillaume
    Bergstra, James
    Bengio, Yoshua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (09) : 1874 - 1887
  • [25] Nonlinear Spike-And-Slab Sparse Coding for Interpretable Image Encoding
    Shelton, Jacquelyn A.
    Sheikh, Abdul-Saboor
    Bornschein, Joerg
    Sterne, Philip
    Luecke, Joerg
    PLOS ONE, 2015, 10 (05):
  • [26] BAYESIAN NONNEGATIVE MATRIX FACTORIZATION WITH A TRUNCATED SPIKE-AND-SLAB PRIOR
    Liu, Yuhang
    Dong, Wenyong
    Song, Wanjuan
    Zhang, Lei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1450 - 1455
  • [27] Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors
    Rui Zhu
    Sufang Chen
    Dong Jiang
    Shitao Xie
    Lei Ma
    Stefano Marchesiello
    Dario Anastasio
    Journal of Vibration Engineering & Technologies, 2024, 12 : 3021 - 3031
  • [28] Enhancing Nonlinear Subspace Identification Using Sparse Bayesian Learning with Spike and Slab Priors
    Zhu, Rui
    Chen, Sufang
    Jiang, Dong
    Xie, Shitao
    Ma, Lei
    Marchesiello, Stefano
    Anastasio, Dario
    JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2024, 12 (03) : 3021 - 3031
  • [29] Bayesian variable selection using spike-and-slab priors with application to high dimensional electroencephalography data by local modelling
    Mohammed, Shariq
    Dey, Dipak K.
    Zhang, Yuping
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2019, 68 (05) : 1305 - 1326
  • [30] HIERARCHICAL SPARSE MODELING USING SPIKE AND SLAB PRIORS
    Suo, Yuanming
    Minh Dao
    Trac Tran
    Srinivas, Umamahesh
    Monga, Vishal
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3103 - 3107