Cross-modal and Cross-medium Adversarial Attack for Audio

被引：0

作者：

Zhang, Liguo ^{[1
]}

Tian, Zilin ^{[1
]}

Long, Yunfei ^{[1
]}

Li, Sizhao ^{[1
]}

Yin, Guisheng ^{[1
]}

机构：

[1] Harbin Engn Univ, Harbin, Heilongjiang, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

关键词：

Audio signal; Cross-modal; Cross-medium; Adversarial Attack; NETWORK;

D O I：

10.1145/3581783.3612475

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Acoustic waves are forms of energy that propagate through various mediums. They can be represented by different modalities, such as auditory signals and visual patterns. The two modalities are often described as one-dimensional waveform in the time domain and two-dimensional spectrogram in the frequency domain. Most acoustic signal processing methods use single modal data for input and training models. This poses a challenge for black-box adversarial attacks on audio signals because the input modality is also unknown to the attacker. In fact, there currently exist no methods that explore the cross-modal transferability of adversarial perturbation. This paper investigates the cross-modal transferability from waveform to spectrogram. We argue that the data distributions in the sample space with the different modalities have mapping relations and propose a novel decision-based cross-modal and cross-medium adversarial attack method. Specifically, it generates an initial example with cross-modal attack capability by combining random natural noise, then iteratively reduces the perturbation to enhance its invisibility. It incorporates the constraints of the spectrogram sample space while iteratively optimizing adversarial perturbations for black-box audio classification models. The perturbation is imperceptible to humans, both visually and aurally. Extensive experiments demonstrate that our approach can launch attacks on classification models for sound waves and spectrograms that share the same audio signal. Furthermore, we explore the cross-medium capability of our proposed adversarial attack strategy that can target processing models for acoustic signals propagating in air and seawater. The proposed method has preeminent invisibility and generalization compared to other methods.

引用

页码：444 / 453

页数：10

共 50 条

[1] Adversarial Attack on Deep Cross-Modal Hamming Retrieval
Li, Chao
Gao, Shangqian
Deng, Cheng
Liu, Wei
Huang, Heng
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2198 - 2207
[2] Adversarial Cross-Modal Retrieval
Wang, Bokun
Yang, Yang
Xu, Xing
Hanjalic, Alan
Shen, Heng Tao
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 154 - 162
[3] Cross-modal Adversarial Reprogramming
Neekhara, Paarth
Hussain, Shehzeen
Du, Jinglong
Dubnov, Shlomo
Koushanfar, Farinaz
McAuley, Julian
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2898 - 2906
[4] Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval
Wang, Tianshi
Zhu, Lei
Zhang, Zheng
Zhang, Huaxiang
Han, Junwei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6159 - 6172
[5] Cross-Modal Learning with Adversarial Samples
Li, Chao
Deng, Cheng
Gao, Shangqian
Xie, De
Liu, Wei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[6] Cross-modal discriminant adversarial network
Hu, Peng
Peng, Xi
Zhu, Hongyuan
Lin, Jie
Zhen, Liangli
Wang, Wei
Peng, Dezhong
PATTERN RECOGNITION, 2021, 112
[7] Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
Zheng, Aihua
Hu, Menglan
Jiang, Bo
Huang, Yan
Yan, Yan
Luo, Bin
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 338 - 351
[8] Unsupervised Generative Adversarial Cross-Modal Hashing
Zhang, Jian
Peng, Yuxin
Yuan, Mingkuan
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 539 - 546
[9] Multimodal adversarial network for cross-modal retrieval
Hu, Peng
Peng, Dezhong
Wang, Xu
Xiang, Yong
KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 38 - 50
[10] Dual discriminant adversarial cross-modal retrieval
Pei He
Meng Wang
Ding Tu
Zhuo Wang
Applied Intelligence, 2023, 53 : 4257 - 4267

← 1 2 3 4 5 →