Regularisation of neural networks by enforcing Lipschitz continuity

被引:180
作者
Gouk, Henry [1 ]
Frank, Eibe [2 ]
Pfahringer, Bernhard [2 ]
Cree, Michael J. [2 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Univ Waikato, Hamilton, New Zealand
关键词
Neural networks; Regularisation; Lipschitz continuity;
D O I
10.1007/s10994-020-05929-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant-for multiple p-norms-of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounded Lipschitz constant as a constrained optimisation problem that can be solved using projected stochastic gradient methods. Our evaluation study shows that the performance of the resulting models exceeds that of models trained with other common regularisers. We also provide evidence that the hyperparameters are intuitive to tune, demonstrate how the choice of norm for computing the Lipschitz constant impacts the resulting model, and show that the performance gains provided by our method are particularly noticeable when only a small amount of training data is available.
引用
收藏
页码:393 / 416
页数:24
相关论文
共 50 条
[21]   CUTTING CORNERS PRESERVES LIPSCHITZ CONTINUITY [J].
FENG YUYUDeptof MathUSTC HefeiPRChinaJERNEJ KOZAKDeptof Math University of LjubljanaSlovenija .
AppliedMathematics:AJournalofChineseUniversities(SeriesB), 1994, (01) :31-34
[22]   Local Lipschitz continuity of the stop operator [J].
Desch W. .
Applications of Mathematics, 1998, 43 (6) :461-477
[23]   LIPSCHITZ CONTINUITY OF INNER OUTER FACTORIZATION [J].
WANG, LY .
SYSTEMS & CONTROL LETTERS, 1991, 16 (04) :281-287
[24]   Lipschitz continuity of inf-projections [J].
Wets, RJB .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2003, 25 (1-3) :269-282
[25]   Lipschitz continuity of polyhedral Skorokhod maps [J].
Krejci, P ;
Vladimirov, A .
ZEITSCHRIFT FUR ANALYSIS UND IHRE ANWENDUNGEN, 2001, 20 (04) :817-844
[26]   Lipschitz continuity for solutions of the ?-Poisson equation [J].
Xingdi Chen .
Science China(Mathematics), 2019, 62 (10) :1935-1946
[27]   Lipschitz continuity for isotropic matrix functions [J].
Carlsson, Marcus .
LINEAR ALGEBRA AND ITS APPLICATIONS, 2021, 624 :259-266
[28]   A weight perturbation-based regularisation technique for convolutional neural networks and the application in medical imaging [J].
Khatami, Amin ;
Nazari, Asef ;
Khosravi, Abbas ;
Lim, Chee Peng ;
Nahavandi, Saeid .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 149
[29]   Jacobian Norm Regularisation and Conditioning in Neural ODEs [J].
Josias, Shane ;
Brink, Willie .
ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2022, 2022, 1734 :31-45
[30]   On lengths, areas and Lipschitz continuity of polyharmonic mappings [J].
Chen, J. ;
Rasila, A. ;
Wang, X. .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2015, 422 (02) :1196-1212