LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer

被引：0

作者：

Cao, Yuxin ^{[1
]}

Zhao, Ziyu ^{[2
]}

Xiao, Xi ^{[1
]}

Wang, Derui ^{[3
]}

Xue, Minhui ^{[3
]}

Lu, Jin ^{[4
]}

机构：

[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China

[2] Beijing Univ Technol, Fan Gongxiu Honors Coll, Beijing, Peoples R China

[3] CSIROs Data61, Eveleigh, NSW, Australia

[4] Ping Technol Shenzhen Co Ltd, Shenzhen, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video recognition systems are vulnerable to adversarial examples. Recent studies show that style transfer-based and patch-based unrestricted perturbations can effectively improve attack efficiency. These attacks, however, face two main challenges: 1) Adding large stylized perturbations to all pixels reduces the naturalness of the video and such perturbations can be easily detected. 2) Patch-based video attacks are not extensible to targeted attacks due to the limited search space of reinforcement learning that has been widely used in video attacks recently. In this paper, we focus on the video blackbox setting and propose a novel attack framework named LogoStyleFool by adding a stylized logo to the clean video. We separate the attack into three stages: style reference selection, reinforcement-learning-based logo style transfer, and perturbation optimization. We solve the first challenge by scaling down the perturbation range to a regional logo, while the second challenge is addressed by complementing an optimization stage after reinforcement learning. Experimental results substantiate the overall superiority of LogoStyleFool over three state-of-the-art patch-based attacks in terms of attack performance and semantic preservation. Meanwhile, LogoStyleFool still maintains its performance against two existing patch-based defense methods. We believe that our research is beneficial in increasing the attention of the security community to such subregional style transfer attacks.

引用

页码：945 / 953

页数：9

共 50 条

[31] Style of Action based Individual Recognition in Video Sequences
Pratheepan, Y.
Prasad, G.
Condell, J. V.
2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1236 - 1241
[32] Vehicle Logo Recognition using RCNN for Intelligent Transportation Systems
Murugan, V
Vijaykumar, V. R.
Nidhila, A.
2019 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET 2019): ADVANCING WIRELESS AND MOBILE COMMUNICATIONS TECHNOLOGIES FOR 2020 INFORMATION SOCIETY, 2019, : 107 - 111
[33] Synthesizing data for text recognition with style transfer
Jiahui Li
Siwei Wang
Yongtao Wang
Zhi Tang
Multimedia Tools and Applications, 2019, 78 : 29183 - 29196
[34] Synthesizing data for text recognition with style transfer
Li, Jiahui
Wang, Siwei
Wang, Yongtao
Tang, Zhi
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 29183 - 29196
[35] Style Transfer Via Texture Synthesis
Elad, Michael
Milanfar, Peyman
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) : 2338 - 2351
[36] Multiple feature fusion via hierarchical matching for TV logo recognition
Chen, Wenjie
Lan, Shanzhen
Xu, Pin
2015 8TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2015, : 659 - 663
[37] Arbitrary style transfer via content consistency and style consistency
Yu, Xiaoming
Zhou, Gan
VISUAL COMPUTER, 2024, 40 (03) : 1369 - 1382
[38] Arbitrary style transfer via content consistency and style consistency
Xiaoming Yu
Gan Zhou
The Visual Computer, 2024, 40 : 1369 - 1382
[39] Evolvement Constrained Adversarial Learning for Video Style Transfer
Li, Wenbo
Wen, Longyin
Bian, Xiao
Lyu, Siwei
COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 232 - 248
[40] Video style transfer by consistent adaptive patch sampling
Oriel Frigo
Neus Sabater
Julie Delon
Pierre Hellier
The Visual Computer, 2019, 35 : 429 - 443

← 1 2 3 4 5 →