Stylized Adversarial Defense

被引:3
|
作者
Naseer, Muzammal [1 ,2 ]
Khan, Salman [1 ,2 ]
Hayat, Munawar [3 ]
Khan, Fahad Shahbaz [1 ,4 ,5 ]
Porikli, Fatih [6 ]
机构
[1] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] Australian Natl Univ, Canberra, ACT 2601, Australia
[3] Monash Univ, Clayton, Vic 3800, Australia
[4] Mohamed bin Zayed Univ Artificial Intelligence, Masdar, Abu Dhabi, U Arab Emirates
[5] Linkoping Univ, S-58183 Linkoping, Sweden
[6] Qualcomm, San Diego, CA 92121 USA
关键词
Training; Perturbation methods; Robustness; Multitasking; Predictive models; Computational modeling; Visualization; Adversarial training; style transfer; max-margin learning; adversarial attacks; multi-task objective;
D O I
10.1109/TPAMI.2022.3207917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolution Neural Networks (CNNs) can easily be fooled by subtle, imperceptible changes to the input images. To address this vulnerability, adversarial training creates perturbation patterns and includes them in the training set to robustify the model. In contrast to existing adversarial training methods that only use class-boundary information (e.g., using a cross-entropy loss), we propose to exploit additional information from the feature space to craft stronger adversaries that are in turn used to learn a robust model. Specifically, we use the style and content information of the target sample from another class, alongside its class-boundary information to create adversarial perturbations. We apply our proposed multi-task objective in a deeply supervised manner, extracting multi-scale feature knowledge to create maximally separating adversaries. Subsequently, we propose a max-margin adversarial training approach that minimizes the distance between source image and its adversary and maximizes the distance between the adversary and the target image. Our adversarial training approach demonstrates strong robustness compared to state-of-the-art defenses, generalizes well to naturally occurring corruptions and data distributional shifts, and retains the model's accuracy on clean examples.
引用
收藏
页码:6403 / 6414
页数:12
相关论文
共 50 条
  • [31] ADVERSARIAL DEFENSE FOR DEEP SPEAKER RECOGNITION USING HYBRID ADVERSARIAL TRAINING
    Pal, Monisankha
    Jati, Arindam
    Peri, Raghuveer
    Hsu, Chin-Cheng
    AbdAlmageed, Wael
    Narayanan, Shrikanth
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6164 - 6168
  • [32] FPGA Adaptive Neural Network Quantization for Adversarial Image Attack Defense
    Lu, Yufeng
    Shi, Xiaokang
    Jiang, Jianan
    Deng, Hanhui
    Wang, Yanwen
    Lu, Jiwu
    Wu, Di
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (12) : 14017 - 14028
  • [33] Efficient Defense Against Adversarial Attacks on Multimodal Emotion AI Models
    Cho, Hsin-Hung
    Zeng, Jiang-Yi
    Tsai, Min-Yan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025,
  • [34] Investigating the Factors Impacting Adversarial Attack and Defense Performances in Federated Learning
    Aljaafari, Nura
    Nazzal, Mahmoud
    Sawalmeh, Ahmad H.
    Khreishah, Abdallah
    Anan, Muhammad
    Algosaibi, Abdulelah
    Alnaeem, Mohammed Abdulaziz
    Aldalbahi, Adel
    Alhumam, Abdulaziz
    Vizcarra, Conrado P.
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2024, 71 : 12542 - 12555
  • [35] Rethinking Textual Adversarial Defense for Pre-Trained Language Models
    Wang, Jiayi
    Bao, Rongzhou
    Zhang, Zhuosheng
    Zhao, Hai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2526 - 2540
  • [36] Toward Enhanced Adversarial Robustness Generalization in Object Detection: Feature Disentangled Domain Adaptation for Adversarial Training
    Jung, Yoojin
    Song, Byung Cheol
    IEEE ACCESS, 2024, 12 : 179065 - 179076
  • [37] Curriculum Defense: An Effective Adversarial Training Method
    Yin, Huilin
    Deng, Xiaoyang
    Yan, Jun
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7399 - 7406
  • [38] Stylized Crowd Formation Transformation Through Spatiotemporal Adversarial Learning
    Yan, Dapeng
    Huang, Kexiang
    Zhang, Longfei
    Ding, Gang Yi
    ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (03)
  • [39] An Adversarial Example Defense Algorithm for Intelligent Driving
    Lu, Jiazhong
    Wang, Chenli
    Huang, Yuanyuan
    Ding, Kangyi
    Liu, Xiaolei
    IEEE NETWORK, 2024, 38 (06): : 98 - 105
  • [40] Towards Generating Stylized Image Captions via Adversarial Training
    Nezami, Omid Mohamad
    Dras, Mark
    Wan, Stephen
    Paris, Cecile
    Hamey, Len
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 270 - 284