On robustness of neural ODEs image classifiers

被引：10

作者：

Cui, Wenjun ^{[1
]}

Zhang, Honglei ^{[1
]}

Chu, Haoyu ^{[1
]}

Hu, Pipi ^{[2
]}

Li, Yidong ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China

[2] Microsoft Res AI4Sci, Beijing, Peoples R China

来源：

INFORMATION SCIENCES | 2023年 / 632卷

基金：

中国国家自然科学基金;

关键词：

Neural ODEs; Activation functions; Dynamical behavior; Robustness;

D O I：

10.1016/j.ins.2023.03.049

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural Ordinary Differential Equations (Neural ODEs), as a family of novel deep models, delicately link conventional neural networks and dynamical systems, which bridges the gap between theory and practice. However, they have not made substantial progress on activation functions, and ReLU is always utilized by default. Moreover, the dynamical behavior existing in them becomes more unclear and complicated as training progresses. Fortunately, existing studies have shown that activation functions are essential for Neural ODEs in governing intrinsic dynamics. Motivated by a family of weight functions used to enhance the stability of dynamical systems, we introduce a new activation function named half-Swish to match Neural ODEs. Besides, we explore the effect of evolution time and batch size on Neural ODEs, respectively. Experiments show that our model consistently outperforms Neural ODEs with basic activation functions on robustness both against stochastic noise images and adversarial examples across Fashion-MNIST, CIFAR-10, and CIFAR-100 datasets, which strongly validates the applicability of half-Swish and suggests that half-Swish function plays a positive role in regularizing the dynamic behavior to enhance stability. Meanwhile, our work theoretically provides a prospective framework to choose appropriate activation functions to match neural differential equations.

引用

页码：576 / 593

页数：18

共 50 条

[31] Robustness of Compressed Convolutional Neural Networks
Wijayanto, Arie Wahyu
Jin, Choong Jun
Madhawa, Kaushalya
Murata, Tsuyoshi
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4829 - 4836
[32] Integrating Bayesian inference and neural ODEs for microgrids dynamics parameters estimation
Fadoul, Fathi Farah
Caglar, Ramazan
SUSTAINABLE ENERGY GRIDS & NETWORKS, 2024, 39
[33] Realization Theory of Recurrent Neural ODEs using Polynomial System Embeddings
Gonzalez, Martin
Defourneau, Thibault
Hajri, Hatem
Petreczky, Mihaly
SYSTEMS & CONTROL LETTERS, 2023, 173
[34] Tolerate Failures of the Visual Camera With Robust Image Classifiers
Atif, Muhammad
Ceccarelli, Andrea
Zoppi, Tommaso
Bondavalli, Andrea
IEEE ACCESS, 2023, 11 : 5132 - 5143
[35] Reachable sets of classifiers and regression models: (non-)robustness analysis and robust training
Anna-Kathrin Kopetzki
Stephan Günnemann
Machine Learning, 2021, 110 : 1175 - 1197
[36] Multistability and robustness of complex-valued neural networks with delays and input perturbation
Zhang, Fanghai
Huang, Tingwen
Feng, Dan
Zeng, Zhigang
NEUROCOMPUTING, 2021, 447 : 319 - 328
[37] Survey on Robustness Verification of Feedforward Neural Networks and Recurrent Neural Networks
Liu Y.
Yang P.-F.
Zhang L.-J.
Wu Z.-L.
Feng Y.
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (07): : 1 - 33
[38] A Criterion of Robustness Based on Fuzzy Neural Structure
蔡自兴
HighTechnologyLetters, 1999, (01) : 61 - 63
[39] Reachable sets of classifiers and regression models: (non-)robustness analysis and robust training
Kopetzki, Anna-Kathrin
Gunnemann, Stephan
MACHINE LEARNING, 2021, 110 (06) : 1175 - 1197
[40] Robustness of classification ability of spiking neural networks
Jie Yang
Pingping Zhang
Yan Liu
Nonlinear Dynamics, 2015, 82 : 723 - 730

← 1 2 3 4 5 →