ADVERSARIAL TRAINING WITH STOCHASTIC WEIGHT AVERAGE

被引:6
作者
Hwang, Joong-won [1 ]
Lee, Youngwan [1 ]
Oh, Sungchan [1 ]
Bae, Yuseok [1 ]
机构
[1] Elect & Telecommun Res Inst, Daejeon, South Korea
来源
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年
关键词
Adversarial training; Stochastic Weight Average; Ensemble; Robustness; Hard Example Mining;
D O I
10.1109/ICIP42928.2021.9506548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although adversarial training is the most reliable method to train robust deep neural networks so far, adversarially trained networks still show large gap between their accuracies on clean images and those on adversarial images. In conventional classification problem, one can gain higher accuracy by ensembling multiple networks. However, in adversarial training, there are obstacles to adopt such ensemble method. First, as inner maximization is expensive, training multiple networks adversarially becomes overburden. Moreover, the naive ensemble faces dilemma on choosing target model to generate adversarial examples with. Training adversarial examples of the members causes covariate shift, while training those of ensemble diminishes the benefit of ensembling. With these insights, we adopt stochastic weight average methods and improve it by considering overfitting nature of adversarial training. Our method take the benefit of ensemble while avoiding the described problems. Experiments on CIFAR10 and CIFAR100 shows our method improves the robustness effectively.
引用
收藏
页码:814 / 818
页数:5
相关论文
共 20 条
[1]  
[Anonymous], 2016, P BRIT MACH VIS C BM
[2]  
Athalye A, 2018, PR MACH LEARN RES, V80
[3]  
Garipov T, 2018, ADV NEUR IN, V31
[4]  
Goodfellow IJ, 2015, 3 INT C LEARN REPR I
[5]  
Grefenstette Edward, 2018, ARXIV PREPRINT ARXIV
[6]   Identity Mappings in Deep Residual Networks [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :630-645
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Huang G., 2017, SNAPSHOT ENSEMBLES T
[9]  
Ilyas A, 2019, ADV NEUR IN, V32
[10]  
Izmailov P, 2018, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, P876