Cross-modal and Cross-medium Adversarial Attack for Audio

被引:0
|
作者
Zhang, Liguo [1 ]
Tian, Zilin [1 ]
Long, Yunfei [1 ]
Li, Sizhao [1 ]
Yin, Guisheng [1 ]
机构
[1] Harbin Engn Univ, Harbin, Heilongjiang, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
关键词
Audio signal; Cross-modal; Cross-medium; Adversarial Attack; NETWORK;
D O I
10.1145/3581783.3612475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic waves are forms of energy that propagate through various mediums. They can be represented by different modalities, such as auditory signals and visual patterns. The two modalities are often described as one-dimensional waveform in the time domain and two-dimensional spectrogram in the frequency domain. Most acoustic signal processing methods use single modal data for input and training models. This poses a challenge for black-box adversarial attacks on audio signals because the input modality is also unknown to the attacker. In fact, there currently exist no methods that explore the cross-modal transferability of adversarial perturbation. This paper investigates the cross-modal transferability from waveform to spectrogram. We argue that the data distributions in the sample space with the different modalities have mapping relations and propose a novel decision-based cross-modal and cross-medium adversarial attack method. Specifically, it generates an initial example with cross-modal attack capability by combining random natural noise, then iteratively reduces the perturbation to enhance its invisibility. It incorporates the constraints of the spectrogram sample space while iteratively optimizing adversarial perturbations for black-box audio classification models. The perturbation is imperceptible to humans, both visually and aurally. Extensive experiments demonstrate that our approach can launch attacks on classification models for sound waves and spectrograms that share the same audio signal. Furthermore, we explore the cross-medium capability of our proposed adversarial attack strategy that can target processing models for acoustic signals propagating in air and seawater. The proposed method has preeminent invisibility and generalization compared to other methods.
引用
收藏
页码:444 / 453
页数:10
相关论文
共 50 条
  • [1] Adversarial Attack on Deep Cross-Modal Hamming Retrieval
    Li, Chao
    Gao, Shangqian
    Deng, Cheng
    Liu, Wei
    Huang, Heng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2198 - 2207
  • [2] Adversarial Cross-Modal Retrieval
    Wang, Bokun
    Yang, Yang
    Xu, Xing
    Hanjalic, Alan
    Shen, Heng Tao
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 154 - 162
  • [3] Cross-modal Adversarial Reprogramming
    Neekhara, Paarth
    Hussain, Shehzeen
    Du, Jinglong
    Dubnov, Shlomo
    Koushanfar, Farinaz
    McAuley, Julian
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2898 - 2906
  • [4] Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval
    Wang, Tianshi
    Zhu, Lei
    Zhang, Zheng
    Zhang, Huaxiang
    Han, Junwei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6159 - 6172
  • [5] Cross-Modal Learning with Adversarial Samples
    Li, Chao
    Deng, Cheng
    Gao, Shangqian
    Xie, De
    Liu, Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Cross-modal discriminant adversarial network
    Hu, Peng
    Peng, Xi
    Zhu, Hongyuan
    Lin, Jie
    Zhen, Liangli
    Wang, Wei
    Peng, Dezhong
    PATTERN RECOGNITION, 2021, 112
  • [7] Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching
    Zheng, Aihua
    Hu, Menglan
    Jiang, Bo
    Huang, Yan
    Yan, Yan
    Luo, Bin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 338 - 351
  • [8] Unsupervised Generative Adversarial Cross-Modal Hashing
    Zhang, Jian
    Peng, Yuxin
    Yuan, Mingkuan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 539 - 546
  • [9] Multimodal adversarial network for cross-modal retrieval
    Hu, Peng
    Peng, Dezhong
    Wang, Xu
    Xiang, Yong
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 38 - 50
  • [10] Dual discriminant adversarial cross-modal retrieval
    Pei He
    Meng Wang
    Ding Tu
    Zhuo Wang
    Applied Intelligence, 2023, 53 : 4257 - 4267