On robustness of neural ODEs image classifiers

被引:10
|
作者
Cui, Wenjun [1 ]
Zhang, Honglei [1 ]
Chu, Haoyu [1 ]
Hu, Pipi [2 ]
Li, Yidong [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Microsoft Res AI4Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural ODEs; Activation functions; Dynamical behavior; Robustness;
D O I
10.1016/j.ins.2023.03.049
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural Ordinary Differential Equations (Neural ODEs), as a family of novel deep models, delicately link conventional neural networks and dynamical systems, which bridges the gap between theory and practice. However, they have not made substantial progress on activation functions, and ReLU is always utilized by default. Moreover, the dynamical behavior existing in them becomes more unclear and complicated as training progresses. Fortunately, existing studies have shown that activation functions are essential for Neural ODEs in governing intrinsic dynamics. Motivated by a family of weight functions used to enhance the stability of dynamical systems, we introduce a new activation function named half-Swish to match Neural ODEs. Besides, we explore the effect of evolution time and batch size on Neural ODEs, respectively. Experiments show that our model consistently outperforms Neural ODEs with basic activation functions on robustness both against stochastic noise images and adversarial examples across Fashion-MNIST, CIFAR-10, and CIFAR-100 datasets, which strongly validates the applicability of half-Swish and suggests that half-Swish function plays a positive role in regularizing the dynamic behavior to enhance stability. Meanwhile, our work theoretically provides a prospective framework to choose appropriate activation functions to match neural differential equations.
引用
收藏
页码:576 / 593
页数:18
相关论文
共 50 条
  • [31] Robustness of Compressed Convolutional Neural Networks
    Wijayanto, Arie Wahyu
    Jin, Choong Jun
    Madhawa, Kaushalya
    Murata, Tsuyoshi
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4829 - 4836
  • [32] Integrating Bayesian inference and neural ODEs for microgrids dynamics parameters estimation
    Fadoul, Fathi Farah
    Caglar, Ramazan
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 39
  • [33] Realization Theory of Recurrent Neural ODEs using Polynomial System Embeddings
    Gonzalez, Martin
    Defourneau, Thibault
    Hajri, Hatem
    Petreczky, Mihaly
    SYSTEMS & CONTROL LETTERS, 2023, 173
  • [34] Tolerate Failures of the Visual Camera With Robust Image Classifiers
    Atif, Muhammad
    Ceccarelli, Andrea
    Zoppi, Tommaso
    Bondavalli, Andrea
    IEEE ACCESS, 2023, 11 : 5132 - 5143
  • [35] Reachable sets of classifiers and regression models: (non-)robustness analysis and robust training
    Anna-Kathrin Kopetzki
    Stephan Günnemann
    Machine Learning, 2021, 110 : 1175 - 1197
  • [36] Multistability and robustness of complex-valued neural networks with delays and input perturbation
    Zhang, Fanghai
    Huang, Tingwen
    Feng, Dan
    Zeng, Zhigang
    NEUROCOMPUTING, 2021, 447 : 319 - 328
  • [37] Survey on Robustness Verification of Feedforward Neural Networks and Recurrent Neural Networks
    Liu Y.
    Yang P.-F.
    Zhang L.-J.
    Wu Z.-L.
    Feng Y.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (07): : 1 - 33
  • [38] A Criterion of Robustness Based on Fuzzy Neural Structure
    蔡自兴
    HighTechnologyLetters, 1999, (01) : 61 - 63
  • [39] Reachable sets of classifiers and regression models: (non-)robustness analysis and robust training
    Kopetzki, Anna-Kathrin
    Gunnemann, Stephan
    MACHINE LEARNING, 2021, 110 (06) : 1175 - 1197
  • [40] Robustness of classification ability of spiking neural networks
    Jie Yang
    Pingping Zhang
    Yan Liu
    Nonlinear Dynamics, 2015, 82 : 723 - 730