Spike-and-Slab Shrinkage Priors for Structurally Sparse Bayesian Neural Networks

被引:0
|
作者
Jantre, Sanket [1 ]
Bhattacharya, Shrijita [2 ]
Maiti, Tapabrata [2 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Stat & Probabil, E Lansing, MI USA
关键词
Bayes methods; Neurons; Biological neural networks; Vectors; Slabs; Predictive models; Linear regression; Computational efficiency; Training; Network topology; Bayesian neural networks (BNNs); posterior consistency; spike-and-slab (SS) priors; structured sparsity; variational inference; POSTERIOR CONSISTENCY; HORSESHOE; MODEL; RELU;
D O I
10.1109/TNNLS.2024.3485529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network complexity and computational efficiency have become increasingly significant aspects of deep learning. Sparse deep learning addresses these challenges by recovering a sparse representation of the underlying target function by reducing heavily overparameterized deep neural networks. Specifically, deep neural architectures compressed via structured sparsity (e.g., node sparsity) provide low-latency inference, higher data throughput, and reduced energy consumption. In this article, we explore two well-established shrinkage techniques, Lasso and Horseshoe, for model compression in Bayesian neural networks (BNNs). To this end, we propose structurally sparse BNNs, which systematically prune excessive nodes with the following: 1) spike-and-slab group Lasso (SS-GL) and 2) SS group Horseshoe (SS-GHS) priors, and develop computationally tractable variational inference, including continuous relaxation of Bernoulli variables. We establish the contraction rates of the variational posterior of our proposed models as a function of the network topology, layerwise node cardinalities, and bounds on the network weights. We empirically demonstrate the competitive performance of our models compared with the baseline models in prediction accuracy, model compression, and inference latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Bayesian Change Point Detection with Spike-and-Slab Priors
    Cappello, Lorenzo
    Padilla, Oscar Hernan Madrid
    Palacios, Julia A.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (04) : 1488 - 1500
  • [2] Learning sparse deep neural networks with a spike-and-slab prior
    Sun, Yan
    Song, Qifan
    Liang, Faming
    STATISTICS & PROBABILITY LETTERS, 2022, 180
  • [3] Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels
    Long, Da
    Xing, Wei
    Krishnapriyan, Aditi S.
    Kirby, Robert M.
    Zhe, Shandian
    Mahoney, Michael W.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [4] Bayesian Inference for Spatio-temporal Spike-and-Slab Priors
    Andersen, Michael Riis
    Vehtari, Aki
    Winther, Ole
    Hansen, Lars Kai
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [5] Negotiating multicollinearity with spike-and-slab priors
    Ročková V.
    George E.I.
    METRON, 2014, 72 (2) : 217 - 229
  • [6] On spike-and-slab priors for Bayesian equation discovery of nonlinear dynamical systems via sparse linear regression
    Nayek, R.
    Fuentes, R.
    Worden, K.
    Cross, E. J.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 161
  • [7] Variable Fusion for Bayesian Linear Regression via Spike-and-slab Priors
    Wu, Shengyi
    Shimamura, Kaito
    Yoshikawa, Kohei
    Murayama, Kazuaki
    Kawano, Shuichi
    INTELLIGENT DECISION TECHNOLOGIES, KES-IDT 2021, 2021, 238 : 491 - 501
  • [8] BAYESIAN ESTIMATION OF SPARSE SIGNALS WITH A CONTINUOUS SPIKE-AND-SLAB PRIOR
    Rockova, Veronika
    ANNALS OF STATISTICS, 2018, 46 (01): : 401 - 437
  • [9] Online Bayesian Sparse Learning with Spike and Slab Priors
    Fang, Shikai
    Zhe, Shandian
    Lee, Kuang-chih
    Zhang, Kai
    Neville, Jennifer
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 142 - 151
  • [10] A sparse Bayesian approach to model structure selection and parameter estimation of dynamical systems using spike-and-slab priors
    Nayek, R.
    Worden, K.
    Cross, E. J.
    Fuentes, R.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING (ISMA2020) / INTERNATIONAL CONFERENCE ON UNCERTAINTY IN STRUCTURAL DYNAMICS (USD2020), 2020, : 3639 - 3653