Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks

被引:9
|
作者
Prach, Bernd [1 ]
Lampert, Christoph H. [1 ]
机构
[1] Inst Sci & Technol Austria ISTA, Klosterneuburg, Austria
来源
关键词
Lipschitz networks; Orthogonality; Robustness;
D O I
10.1007/978-3-031-19803-8_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a highly desirable property for deep networks to be robust against small input changes. One popular way to achieve this property is by designing networks with a small Lipschitz constant. In this work, we propose a new technique for constructing such Lipschitz networks that has a number of desirable properties: it can be applied to any linear network layer (fully-connected or convolutional), it provides formal guarantees on the Lipschitz constant, it is easy to implement and efficient to run, and it can be combined with any training objective and optimization method. In fact, our technique is the first one in the literature that achieves all of these properties simultaneously. Our main contribution is a rescaling-based weight matrix parametrization that guarantees each network layer to have a Lipschitz constant of at most 1 and results in the learned weight matrices to be close to orthogonal. Hence we call such layers almost-orthogonal Lipschitz (AOL). Experiments and ablation studies in the context of image classification with certified robust accuracy confirm that AOL layers achieve results that are on par with most existing methods. Yet, they are simpler to implement and more broadly applicable, because they do not require computationally expensive matrix orthogonalization or inversion steps as part of the network architecture. We provide code at https://github.com/berndprach/AOL.
引用
收藏
页码:350 / 365
页数:16
相关论文
共 50 条
  • [1] Towards Efficient Processing of General-Purpose Joins in Sensor Networks
    Stern, Mirco
    Buchmann, Erik
    Boehm, Klemens
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 126 - 137
  • [2] AN EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER
    GALIL, Z
    PAUL, WJ
    JOURNAL OF THE ACM, 1983, 30 (02) : 360 - 387
  • [3] General-purpose compression for efficient retrieval
    Cannane, A
    Williams, HE
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (05): : 430 - 437
  • [4] EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER.
    Galil, Zvi
    Paul, Wolfang J.
    Journal of the ACM, 1983, 30 (02): : 360 - 387
  • [5] Efficient Utilization of SIMD Engines for General-Purpose Processors
    Huang, Libo
    Wang, Zhiying
    Xiao, Nong
    Dou, Qiang
    COMPUTER JOURNAL, 2014, 57 (08): : 1141 - 1154
  • [6] General-Purpose FPGA Platform for Efficient Encryption and Hashing
    Szefer, Jakub
    Chen, Yu-Yuan
    Lee, Ruby B.
    21ST IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2010,
  • [7] Runtime reconfiguration techniques for efficient general-purpose computation
    Xu, BX
    Albonesi, DH
    IEEE DESIGN & TEST OF COMPUTERS, 2000, 17 (01): : 42 - 52
  • [8] An Efficient Formulation for General-Purpose Multibody/Multiphysics Analysis
    Masarati, Pierangelo
    Morandini, Marco
    Mantegazza, Paolo
    JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS, 2014, 9 (04):
  • [10] A general-purpose communication networks simulation system - GPCNSS
    Cao, Y.
    Zhang, W.
    Sha, J.
    Xu, L.
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2001, 13 (03): : 353 - 356