Information-theoretic generalization bounds for black-box learning algorithms

被引:0
作者
Harutyunyan, Hrayr [1 ]
Raginsky, Maxim [2 ]
Ver Steeg, Greg [1 ]
Galstyan, Aram [1 ]
机构
[1] USC Informat Sci Inst, Marina Del Rey, CA 90292 USA
[2] Univ Illinois, Champaign, IL USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
关键词
INFERENCE ATTACKS; STABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We derive information-theoretic generalization bounds for supervised learning algorithms based on the information contained in predictions rather than in the output of the training algorithm. These bounds improve over the existing information-theoretic bounds, are applicable to a wider range of algorithms, and solve two key challenges: (a) they give meaningful results for deterministic algorithms and (b) they are significantly easier to estimate. We show experimentally that the proposed bounds closely follow the generalization gap in practical scenarios for deep learning.
引用
收藏
页数:13
相关论文
共 41 条
[1]  
Alabdulmohsin I. M., 2015, Advances in Neural Information Processing Systems, P19
[2]   Towards a Unified Theory of Learning and Information [J].
Alabdulmohsin, Ibrahim .
ENTROPY, 2020, 22 (04)
[3]  
[Anonymous], 2019, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2018, PMLR
[5]  
Bassily R., 2018, ALGORITHMIC LEARNING, P25
[6]   Algorithmic Stability for Adaptive Data Analysis [J].
Bassily, Raef ;
Nissim, Kobbi ;
Smith, Adam ;
Steinke, Thomas ;
Stemmer, Uri ;
Ullman, Jonathan .
STOC'16: PROCEEDINGS OF THE 48TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2016, :1046-1059
[7]   Stability and generalization [J].
Bousquet, O ;
Elisseeff, A .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) :499-526
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]  
Esposito Amedeo Roberto, 2021, IEEE Transactions on Information Theory
[10]   RECURSIVE STOCHASTIC ALGORITHMS FOR GLOBAL OPTIMIZATION IN RD [J].
GELFAND, SB ;
MITTER, SK .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1991, 29 (05) :999-1018