LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer

被引：0

作者：

Cao, Yuxin ^{[1
]}

Zhao, Ziyu ^{[2
]}

Xiao, Xi ^{[1
]}

Wang, Derui ^{[3
]}

Xue, Minhui ^{[3
]}

Lu, Jin ^{[4
]}

机构：

[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China

[2] Beijing Univ Technol, Fan Gongxiu Honors Coll, Beijing, Peoples R China

[3] CSIROs Data61, Eveleigh, NSW, Australia

[4] Ping Technol Shenzhen Co Ltd, Shenzhen, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video recognition systems are vulnerable to adversarial examples. Recent studies show that style transfer-based and patch-based unrestricted perturbations can effectively improve attack efficiency. These attacks, however, face two main challenges: 1) Adding large stylized perturbations to all pixels reduces the naturalness of the video and such perturbations can be easily detected. 2) Patch-based video attacks are not extensible to targeted attacks due to the limited search space of reinforcement learning that has been widely used in video attacks recently. In this paper, we focus on the video blackbox setting and propose a novel attack framework named LogoStyleFool by adding a stylized logo to the clean video. We separate the attack into three stages: style reference selection, reinforcement-learning-based logo style transfer, and perturbation optimization. We solve the first challenge by scaling down the perturbation range to a regional logo, while the second challenge is addressed by complementing an optimization stage after reinforcement learning. Experimental results substantiate the overall superiority of LogoStyleFool over three state-of-the-art patch-based attacks in terms of attack performance and semantic preservation. Meanwhile, LogoStyleFool still maintains its performance against two existing patch-based defense methods. We believe that our research is beneficial in increasing the attention of the security community to such subregional style transfer attacks.

引用

页码：945 / 953

页数：9

共 50 条

[21] Logo Recognition via Fusion of Spatial and Spectral Features
Shakir, Sahar
Gacav, Caner
Topal, Cihan
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[22] A Novel Location and Matching Algorithm for Rapid Logo Recognition in Video Advertisements
Zhang, Yuan
Zhang, Shuwu
Liang, Wei
Liang, Jinchun
PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, : 43 - 47
[23] Towards efficient image and video style transfer via distillation and learnable feature transformation
Huo, Jing
Kong, Meihao
Li, Wenbin
Wu, Jing
Lai, Yu-Kun
Gao, Yang
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
[24] Consistent Panoramic Video Style Transfer via Temporal-Spatial Cross Perception
Wang, Weiyu
Qing, Chunmei
Tan, Junpeng
Xu, Xiangmin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 265 - 277
[25] Style-A-Video: Agile Diffusion for Arbitrary Text-Based Video Style Transfer
Huang, Nisha
Zhang, Yuxin
Dong, Weiming
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1494 - 1498
[26] Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer
Chen, Junming
Jiang, Meirui
Dou, Qi
Chen, Qifeng
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 361 - 370
[27] Fast Video Multi-Style Transfer
Gao, Wei
Lie, Yijun
Yin, Yihang
Yang, Ming-Hsuan
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3211 - 3219
[28] HeterStyle: A Heterogeneous Video Style Transfer Application
Liu, Xingyu
Guo, Jingfan
Ren, Tongwei
Han, Yahong
Huang, Lei
Wu, Gangshan
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1272 - 1273
[29] Image and Video Style Transfer Based on Transformer
Fengxue, Sun
Yanguo, Sun
Zhenping, Lan
Yanqi, Wang
Nianchao, Zhang
Yuru, Wang
Ping, Li
IEEE ACCESS, 2023, 11 : 56400 - 56407
[30] Style of Action based Individual Recognition in Video Sequences
Pratheepan, Y.
Prasad, G.
Condell, J. V.
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1236 - 1241

← 1 2 3 4 5 →