Lipschitz Certificates for Layered Network Structures Driven by Averaged Activation Operators

被引:49
作者
Combettes, Patrick L. [1 ]
Pesquet, Jean-Christophe [2 ]
机构
[1] North Carolina State Univ, Dept Math, Raleigh, NC 27695 USA
[2] Univ Paris Saclay, OPIS Inria Project Team, Ctr Visual Comp, Cent Supelec, F-91190 Gif Sur Yvette, France
来源
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE | 2020年 / 2卷 / 02期
基金
美国国家科学基金会;
关键词
activation function; neural network; nonexpansive operator; averaged operator; stability; layered network; CONVERGENCE; ALGORITHM; MODEL;
D O I
10.1137/19M1272780
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Obtaining sharp Lipschitz constants for feed-forward neural networks is essential to assess their robustness in the face of perturbations of their inputs. We derive such constants in the context of a general layered network model involving compositions of nonexpansive averaged operators and affine operators. By exploiting this architecture, our analysis finely captures the interactions between the layers, yielding tighter Lipschitz constants than those resulting from the product of individual bounds for groups of layers. The proposed framework is shown to cover in particular many practical instances encountered in feed-forward neural networks. Our Lipschitz constant estimates are further improved in the case of structures employing scalar nonlinear functions, which include standard convolutional networks as special cases.
引用
收藏
页码:529 / 557
页数:29
相关论文
共 58 条
[1]   Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey [J].
Akhtar, Naveed ;
Mian, Ajmal .
IEEE ACCESS, 2018, 6 :14410-14430
[2]   Robust multilayer neural network based on median neuron model [J].
Aladag, Cagdas Hakan ;
Egrioglu, Erol ;
Yolcu, Ufuk .
NEURAL COMPUTING & APPLICATIONS, 2014, 24 (3-4) :945-956
[3]  
[Anonymous], 2013, P 30 INT C MACH LEAR
[4]  
[Anonymous], 2017, PREPRINT
[5]  
[Anonymous], 2010, CONVOLUTIONAL DEEP B
[6]  
[Anonymous], P 14 INT C ART INT S
[7]  
Athalye A, 2018, PR MACH LEARN RES, V80
[8]  
Baillon J. B., 1978, Houston Journal of Mathematics, V4, P1
[9]  
Balan R., 2017, PREPRINT
[10]  
Bartlett P., 2017, ADV NEUR IN, P6240