LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer

被引:0
|
作者
Cao, Yuxin [1 ]
Zhao, Ziyu [2 ]
Xiao, Xi [1 ]
Wang, Derui [3 ]
Xue, Minhui [3 ]
Lu, Jin [4 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Beijing Univ Technol, Fan Gongxiu Honors Coll, Beijing, Peoples R China
[3] CSIROs Data61, Eveleigh, NSW, Australia
[4] Ping Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2 | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video recognition systems are vulnerable to adversarial examples. Recent studies show that style transfer-based and patch-based unrestricted perturbations can effectively improve attack efficiency. These attacks, however, face two main challenges: 1) Adding large stylized perturbations to all pixels reduces the naturalness of the video and such perturbations can be easily detected. 2) Patch-based video attacks are not extensible to targeted attacks due to the limited search space of reinforcement learning that has been widely used in video attacks recently. In this paper, we focus on the video blackbox setting and propose a novel attack framework named LogoStyleFool by adding a stylized logo to the clean video. We separate the attack into three stages: style reference selection, reinforcement-learning-based logo style transfer, and perturbation optimization. We solve the first challenge by scaling down the perturbation range to a regional logo, while the second challenge is addressed by complementing an optimization stage after reinforcement learning. Experimental results substantiate the overall superiority of LogoStyleFool over three state-of-the-art patch-based attacks in terms of attack performance and semantic preservation. Meanwhile, LogoStyleFool still maintains its performance against two existing patch-based defense methods. We believe that our research is beneficial in increasing the attention of the security community to such subregional style transfer attacks.
引用
收藏
页码:945 / 953
页数:9
相关论文
共 50 条
  • [21] Logo Recognition via Fusion of Spatial and Spectral Features
    Shakir, Sahar
    Gacav, Caner
    Topal, Cihan
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [22] A Novel Location and Matching Algorithm for Rapid Logo Recognition in Video Advertisements
    Zhang, Yuan
    Zhang, Shuwu
    Liang, Wei
    Liang, Jinchun
    PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, : 43 - 47
  • [23] Towards efficient image and video style transfer via distillation and learnable feature transformation
    Huo, Jing
    Kong, Meihao
    Li, Wenbin
    Wu, Jing
    Lai, Yu-Kun
    Gao, Yang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [24] Consistent Panoramic Video Style Transfer via Temporal-Spatial Cross Perception
    Wang, Weiyu
    Qing, Chunmei
    Tan, Junpeng
    Xu, Xiangmin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 265 - 277
  • [25] Style-A-Video: Agile Diffusion for Arbitrary Text-Based Video Style Transfer
    Huang, Nisha
    Zhang, Yuxin
    Dong, Weiming
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1494 - 1498
  • [26] Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer
    Chen, Junming
    Jiang, Meirui
    Dou, Qi
    Chen, Qifeng
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 361 - 370
  • [27] Fast Video Multi-Style Transfer
    Gao, Wei
    Lie, Yijun
    Yin, Yihang
    Yang, Ming-Hsuan
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3211 - 3219
  • [28] HeterStyle: A Heterogeneous Video Style Transfer Application
    Liu, Xingyu
    Guo, Jingfan
    Ren, Tongwei
    Han, Yahong
    Huang, Lei
    Wu, Gangshan
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1272 - 1273
  • [29] Image and Video Style Transfer Based on Transformer
    Fengxue, Sun
    Yanguo, Sun
    Zhenping, Lan
    Yanqi, Wang
    Nianchao, Zhang
    Yuru, Wang
    Ping, Li
    IEEE ACCESS, 2023, 11 : 56400 - 56407
  • [30] Style of Action based Individual Recognition in Video Sequences
    Pratheepan, Y.
    Prasad, G.
    Condell, J. V.
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1236 - 1241