LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer

被引:0
作者
Cao, Yuxin [1 ]
Zhao, Ziyu [2 ]
Xiao, Xi [1 ]
Wang, Derui [3 ]
Xue, Minhui [3 ]
Lu, Jin [4 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Beijing Univ Technol, Fan Gongxiu Honors Coll, Beijing, Peoples R China
[3] CSIROs Data61, Eveleigh, NSW, Australia
[4] Ping Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2 | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video recognition systems are vulnerable to adversarial examples. Recent studies show that style transfer-based and patch-based unrestricted perturbations can effectively improve attack efficiency. These attacks, however, face two main challenges: 1) Adding large stylized perturbations to all pixels reduces the naturalness of the video and such perturbations can be easily detected. 2) Patch-based video attacks are not extensible to targeted attacks due to the limited search space of reinforcement learning that has been widely used in video attacks recently. In this paper, we focus on the video blackbox setting and propose a novel attack framework named LogoStyleFool by adding a stylized logo to the clean video. We separate the attack into three stages: style reference selection, reinforcement-learning-based logo style transfer, and perturbation optimization. We solve the first challenge by scaling down the perturbation range to a regional logo, while the second challenge is addressed by complementing an optimization stage after reinforcement learning. Experimental results substantiate the overall superiority of LogoStyleFool over three state-of-the-art patch-based attacks in terms of attack performance and semantic preservation. Meanwhile, LogoStyleFool still maintains its performance against two existing patch-based defense methods. We believe that our research is beneficial in increasing the attention of the security community to such subregional style transfer attacks.
引用
收藏
页码:945 / 953
页数:9
相关论文
共 50 条
  • [31] Style of Action based Individual Recognition in Video Sequences
    Pratheepan, Y.
    Prasad, G.
    Condell, J. V.
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1236 - 1241
  • [32] Vehicle Logo Recognition using RCNN for Intelligent Transportation Systems
    Murugan, V
    Vijaykumar, V. R.
    Nidhila, A.
    2019 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET 2019): ADVANCING WIRELESS AND MOBILE COMMUNICATIONS TECHNOLOGIES FOR 2020 INFORMATION SOCIETY, 2019, : 107 - 111
  • [33] Synthesizing data for text recognition with style transfer
    Jiahui Li
    Siwei Wang
    Yongtao Wang
    Zhi Tang
    Multimedia Tools and Applications, 2019, 78 : 29183 - 29196
  • [34] Synthesizing data for text recognition with style transfer
    Li, Jiahui
    Wang, Siwei
    Wang, Yongtao
    Tang, Zhi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 29183 - 29196
  • [35] Style Transfer Via Texture Synthesis
    Elad, Michael
    Milanfar, Peyman
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) : 2338 - 2351
  • [36] Multiple feature fusion via hierarchical matching for TV logo recognition
    Chen, Wenjie
    Lan, Shanzhen
    Xu, Pin
    2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2015, : 659 - 663
  • [37] Arbitrary style transfer via content consistency and style consistency
    Yu, Xiaoming
    Zhou, Gan
    VISUAL COMPUTER, 2024, 40 (03) : 1369 - 1382
  • [38] Arbitrary style transfer via content consistency and style consistency
    Xiaoming Yu
    Gan Zhou
    The Visual Computer, 2024, 40 : 1369 - 1382
  • [39] Evolvement Constrained Adversarial Learning for Video Style Transfer
    Li, Wenbo
    Wen, Longyin
    Bian, Xiao
    Lyu, Siwei
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 232 - 248
  • [40] Video style transfer by consistent adaptive patch sampling
    Oriel Frigo
    Neus Sabater
    Julie Delon
    Pierre Hellier
    The Visual Computer, 2019, 35 : 429 - 443