Cube-Evo: A Query-Efficient Black-Box Attack on Video Classification System

被引:2
作者
Zhan, Yu [1 ]
Fu, Ying [2 ]
Huang, Liang [1 ]
Guo, Jianmin [3 ]
Shi, Heyuan [1 ]
Song, Houbing [4 ]
Hu, Chao [1 ]
机构
[1] Cent South Univ, Changsha 410017, Peoples R China
[2] Natl Univ Def Technol, Changsha 410073, Peoples R China
[3] Tsinghua Univ, Beijing 100190, Peoples R China
[4] Univ Maryland Baltimore Cty, Baltimore, MD 21250 USA
基金
国家重点研发计划; 中国国家自然科学基金; 湖南省自然科学基金;
关键词
Perturbation methods; Pipelines; Closed box; Costs; Sociology; Estimation; Security; Adversarial examples; black-box attack; deep learning; system testing; video classification;
D O I
10.1109/TR.2023.3261986
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The current progressive research in the domain of black-box adversarial attack enhances the reliability of deep neural network (DNN)-based video systems. Recent works mainly carry out black-box adversarial attacks on video systems by query-based parameter dimension reduction. However, the additional temporal dimension of video data leads to massive query consumption and low attack success rate. In this article, we embark on our efforts to design an effective adversarial attack on popular video classification systems. We deeply root the observations that the DNN-based systems are sensitive to adversarial perturbations with high frequency and reconstructed shape. Specifically, we propose a systematic attack pipeline Cube-Evo, aiming to reduce the search space dimension and obtain the effective adversarial perturbation via the optimal parameter group updating. We evaluate the proposed attack pipeline on two popular datasets: UCF101 and JESTER. Our attack pipeline reduces query consumption and achieves a high success rate on various DNN-based video classification systems. Compared with the state-of-the-art method Geo-Trap-Att, our pipeline averagely reduces 1.6x query consumption in untargeted attacks and 2.9x in targeted attacks. Besides, Cube-Evo improves 13% attack success rate on average, achieving new state-of-the-art results over diverse video classification systems.
引用
收藏
页码:1160 / 1171
页数:12
相关论文
共 50 条
  • [1] Al-Dujaili A., 2020, PROC INT C LEARN REP
  • [2] Andriushchenko Maksym, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12368), P484, DOI 10.1007/978-3-030-58592-1_29
  • [3] Practical Black-Box Attacks on Deep Neural Networks Using Efficient Query Mechanisms
    Bhagoji, Arjun Nitin
    He, Warren
    Li, Bo
    Song, Dawn
    [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 158 - 174
  • [4] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [5] Chambolle A, 2004, J MATH IMAGING VIS, V20, P89
  • [6] Chen P., 2017, AISEC 17 P 10 ACM WO, P15, DOI [DOI 10.1145/3128572.3140448, 10.1145/3128572.3140448]
  • [7] Chen PY, 2018, AAAI CONF ARTIF INTE, P10
  • [8] Appending Adversarial Frames for Universal Video Attack
    Chen, Zhikai
    Xie, Lingxi
    Pang, Shanmin
    He, Yong
    Tian, Qi
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3198 - 3207
  • [9] Cheng S., 2019, PROC INT C NEURAL IN, P10932
  • [10] Croce F, 2022, AAAI CONF ARTIF INTE, P6437