Optimization and Bayes: A Trade-off for Overparameterized Neural Networks

被引:0
作者
Hu, Zhengmian [1 ]
Huang, Heng [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20740 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel algorithm, Transformative Bayesian Learning (TransBL), which bridges the gap between empirical risk minimization (ERM) and Bayesian learning for neural networks. We compare ERM, which uses gradient descent to optimize, and Bayesian learning with importance sampling for their generalization and computational complexity. We derive the first algorithm-dependent PAC-Bayesian generalization bound for infinitely wide networks based on an exact KL divergence between the trained posterior distribution obtained by infinitesimal step size gradient descent and a Gaussian prior. Moreover, we show how to transform gradient-based optimization into importance sampling by incorporating a weight. While Bayesian learning has better generalization, it suffers from low sampling efficiency. Optimization methods, on the other hand, have good sampling efficiency but poor generalization. Our proposed algorithm TransBL enables a trade-off between generalization and sampling efficiency.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] Understanding the Energy vs. Adversarial Robustness Trade-Off in Deep Neural Networks
    Lee, Kyungmi
    Chandrakasan, Anantha P.
    IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS, 2021, 2 : 843 - 855
  • [12] Triangular Trade-off between Robustness, Accuracy, and Fairness in Deep Neural Networks: A Survey
    Li, Jingyang
    Li, Guoqiang
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [13] Rate-Accuracy Trade-Off in Video Classification With Deep Convolutional Neural Networks
    Jubran, Mohammad
    Abbas, Alhabib
    Chadha, Aaron
    Andreopoulos, Yiannis
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 145 - 154
  • [14] Trade-off between gradient measurement efficiency and expressivity in deep quantum neural networks
    Koki Chinzei
    Shinichiro Yamano
    Quoc Hoan Tran
    Yasuhiro Endo
    Hirotaka Oshima
    npj Quantum Information, 11 (1)
  • [15] RATE-ACCURACY TRADE-OFF IN VIDEO CLASSIFICATION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS
    Abbas, Alhabib
    Jubran, Mohammad
    Chadha, Aaron
    Andreopoulos, Yiannis
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 793 - 797
  • [16] NO TRADE-OFF
    NICOLINI, M
    NATION, 1977, 224 (20) : 610 - 610
  • [17] TRADE-OFF
    MANKIW, NG
    NEW REPUBLIC, 1991, 204 (13) : 4 - 4
  • [18] Online convex optimization in wireless networks and beyond: The feedback-performance trade-off
    Belmega, E. Veronica
    Mertikopoulos, Panayotis
    Negrel, Romain
    2022 20TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2022), 2022, : 298 - 305
  • [19] Throughput-Delay Trade-Off for Cognitive Radio Networks: A Convex Optimization Perspective
    Hu, Hang
    Zhang, Hang
    Yu, Hong
    ABSTRACT AND APPLIED ANALYSIS, 2014,
  • [20] Uncertainty trade-off and disturbance trade-off for quantum measurements
    Srinivas, M. D.
    Mandayam, Prabha
    CURRENT SCIENCE, 2015, 109 (11): : 2044 - 2051