Spike-and-Slab Shrinkage Priors for Structurally Sparse Bayesian Neural Networks

被引:0
|
作者
Jantre, Sanket [1 ]
Bhattacharya, Shrijita [2 ]
Maiti, Tapabrata [2 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI USA
关键词
Bayes methods; Neurons; Biological neural networks; Vectors; Slabs; Predictive models; Linear regression; Computational efficiency; Training; Network topology; Bayesian neural networks (BNNs); posterior consistency; spike-and-slab (SS) priors; structured sparsity; variational inference; POSTERIOR CONSISTENCY; HORSESHOE; MODEL; RELU;
D O I
10.1109/TNNLS.2024.3485529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily overparameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g., node sparsity) provide low-latency inference, higher data throughput, and reduced energy consumption. In this article, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks (BNNs). To this end, we propose structurally sparse BNNs, which systematically prune excessive nodes with the following: 1) spike-and-slab group Lasso (SS-GL) and 2) SS group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference, including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layerwise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared with the baseline models in prediction accuracy, model compression, and inference latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Bayesian sparse convex clustering via global-local shrinkage priors
    Kaito Shimamura
    Shuichi Kawano
    Computational Statistics, 2021, 36 : 2671 - 2699
  • [42] Bayesian sparse convex clustering via global-local shrinkage priors
    Shimamura, Kaito
    Kawano, Shuichi
    COMPUTATIONAL STATISTICS, 2021, 36 (04) : 2671 - 2699
  • [43] Generalized cumulative shrinkage process priors with applications to sparse Bayesian factor analysis
    Fruehwirth-Schnatter, Sylvia
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 381 (2247):
  • [44] Which UGC features drive web purchase intent? A spike-and-slab Bayesian Variable Selection Approach
    Owusu, Richard A.
    Mutshinda, Crispin M.
    Antai, Imoh
    Dadzie, Kofi Q.
    Winston, Evelyn M.
    INTERNET RESEARCH, 2016, 26 (01) : 22 - 37
  • [45] Bayesian Image-on-Scalar Regression with a Spatial Global-Lo cal Spike-and-Slab Prior
    Zeng, Zijian
    Li, Meng
    Vannucci, Marina
    BAYESIAN ANALYSIS, 2024, 19 (01): : 235 - 260
  • [46] BAYESIAN NEURAL NETWORKS FOR SPARSE CODING
    Kuzin, Danil
    Isupova, Olga
    Mihaylova, Lyudmila
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2992 - 2996
  • [47] Advantages of Spike and Slab Priors for Detecting Differential Item Functioning Relative to Other Bayesian Regularizing Priors and Frequentist Lasso
    Chen, Siyuan Marco
    Bauer, Daniel J.
    Belzak, William M.
    Brandt, Holger
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2022, 29 (01) : 122 - 139
  • [48] Sparse Bayesian Recurrent Neural Networks
    Chatzis, Sotirios P.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 359 - 372
  • [49] Understanding Priors in Bayesian Neural Networks at the Unit Level
    Vladimirova, Mariia
    Verbeek, Jakob
    Mesejo, Pablo
    Arbel, Julyan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [50] Incorporating spatial structure into inclusion probabilities for Bayesian variable selection in generalized linear models with the spike-and-slab elastic net
    Leach, Justin M.
    Aban, Inmaculada
    Yi, Nengjun
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2022, 217 : 141 - 152