Certified defense against patch attacks via mask-guided randomized smoothing

被引:0
|
作者
Kui Zhang
Hang Zhou
Huanyu Bian
Weiming Zhang
Nenghai Yu
机构
[1] University of Science and Technology of China,School of Cyber Science and Technology
[2] Simon Fraser University,School of Computer Science
来源
Science China Information Sciences | 2022年 / 65卷
关键词
certified defense; adversarial patch; patch localization; randomized smoothing; joint voting;
D O I
暂无
中图分类号
学科分类号
摘要
The adversarial patch is a practical and effective method that modifies a small region on an image, making DNNs fail to classify. Existing empirical defenses against adversarial patch attacks lack theoretical analysis and are vulnerable to adaptive attacks. To overcome such shortcomings, certified defenses that provide a guaranteed classification performance in the face of strong unknown adversarial attacks are proposed. However, on the one hand, existing certified defenses either have low clean accuracy or need specified architecture, which is not robust enough. On the other hand, they can only provide provable accuracy but ignore the relationship to the number of perturbations. In this paper, we propose a certified defense against patch attacks that provides both the provable radius and high classification accuracy. By adding Gaussian noises only on the patch region with a mask, we prove that a stronger certificate with high confidence can be achieved by randomized smoothing. Furthermore, we design a practical scheme based on joint voting to find the patch with a high probability and certify it effectively. Our defense achieves 86.4% clean accuracy and 71.8% certified accuracy on CIFAR-10 exceeding the maximum 60% certified accuracy of existing methods. The clean accuracy of 67.8% and the certified accuracy of 53.6% on ImageNet are better than the state-of-the-art method, whose certified accuracy is 26%.
引用
收藏
相关论文
共 50 条
  • [1] Certified defense against patch attacks via mask-guided randomized smoothing
    Kui ZHANG
    Hang ZHOU
    Huanyu BIAN
    Weiming ZHANG
    Nenghai YU
    Science China(Information Sciences), 2022, 65 (07) : 86 - 97
  • [2] Certified defense against patch attacks via mask-guided randomized smoothing
    Zhang, Kui
    Zhou, Hang
    Bian, Huanyu
    Zhang, Weiming
    Yu, Nenghai
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (07)
  • [3] Towards Robust Certified Defense via Improved Randomized Smoothing
    Kumar, Satyadwyoom
    Narayan, Apurva
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] Mask-guided noise restriction adversarial attacks for image classification
    Duan, Yexin
    Zhou, Xingyu
    Zou, Junhua
    Qiu, Junyang
    Zhang, Jin
    Pan, Zhisong
    COMPUTERS & SECURITY, 2021, 100
  • [5] Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
    Zeng, Jiehang
    Xu, Jianhan
    Zheng, Xiaoqing
    Huang, Xuanjing
    COMPUTATIONAL LINGUISTICS, 2023, 49 (02) : 395 - 427
  • [6] GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing
    Hao, Zhongkai
    Ying, Chengyang
    Dong, Yinpeng
    Su, Hang
    Song, Jian
    Zhu, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [7] Scalable Certified Segmentation via Randomized Smoothing
    Fischer, Marc
    Baader, Maximilian
    Vechev, Martin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [8] Certified Adversarial Robustness via Randomized Smoothing
    Cohen, Jeremy
    Rosenfeld, Elan
    Kolter, J. Zico
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] Wasserstein Smoothing: Certified Robustness against Wasserstein Adversarial Attacks
    Levine, Alexander
    Feizi, Soheil
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 3938 - 3946
  • [10] Co-saliency Detection via Mask-guided Fully Convolutional Networks with Multi-scale Label Smoothing
    Zhang, Kaihua
    Li, Tengpeng
    Liu, Bo
    Liu, Qingshan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3090 - 3099