Multi-layer Feature Augmentation Based Transferable Adversarial Examples Generation for Speaker Recognition

被引:0
作者
Li, Zhuhai [1 ]
Zhang, Jie [1 ]
Guo, Wu [1 ]
机构
[1] Univ Sci & Technol China, NERC SLIP, Hefei 230027, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024 | 2024年 / 14865卷
关键词
Adversarial Attack; Transferability; Speaker Recognition;
D O I
10.1007/978-981-97-5591-2_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adversarial examples that almost remain imperceptible for human can mislead practical speaker recognition systems. However, most existing adversaries generated by substitute models have a poor transferability to attack the unseen victim models. To tackle this problem, in this work we propose a multilayer feature augmentation method to improve the transferability of adversarial examples. Specifically, we apply data augmentation on the intermediate-layer feature maps of the substitute model to create diverse pseudo victim models. By attacking the ensemble of the substitute model and the corresponding augmented models, the proposed method can help the adversarial examples avoid overfitting, resulting in more transferable adversarial examples. Experimental results on the VoxCeleb dataset verify the effectiveness of the proposed approach for the speaker identification and speaker verification tasks.
引用
收藏
页码:373 / 385
页数:13
相关论文
共 50 条
  • [41] PART-BASED FEATURE SQUEEZING TO DETECT ADVERSARIAL EXAMPLES IN PERSON RE-IDENTIFICATION NETWORKS
    Zheng, Yu
    Velipasalar, Senem
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 844 - 848
  • [42] MULTI-VIEW SELF-ATTENTION BASED TRANSFORMER FOR SPEAKER RECOGNITION
    Wang, Rui
    Ao, Junyi
    Zhou, Long
    Liu, Shujie
    Wei, Zhihua
    Ko, Tom
    Li, Qing
    Zhang, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6732 - 6736
  • [43] Multi-task learning for X-vector based speaker recognition
    Zhang Y.
    Liu L.
    International Journal of Speech Technology, 2023, 26 (04) : 817 - 823
  • [44] Speaker recognition based on characteristic spectrograms and an improved self-organizing feature map neural network
    Jia, Yanjie
    Chen, Xi
    Yu, Jieqiong
    Wang, Lianming
    Xu, Yuanzhe
    Liu, Shaojin
    Wang, Yonghui
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1749 - 1757
  • [45] The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition
    Turner, Claude
    Joseph, Anthony
    Aksu, Murat
    Langdon, Heather
    COMPLEX ADAPTIVE SYSTEMS, 2011, 6
  • [46] Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
    Zhang, Yu-Jia
    Wang, Yih-Wen
    Chen, Chia-Ping
    Lu, Chung-Li
    Chan, Bo-Cheng
    INTERSPEECH 2021, 2021, : 76 - 80
  • [47] Speaker recognition based on characteristic spectrograms and an improved self-organizing feature map neural network
    Yanjie Jia
    Xi Chen
    Jieqiong Yu
    Lianming Wang
    Yuanzhe Xu
    Shaojin Liu
    Yonghui Wang
    Complex & Intelligent Systems, 2021, 7 : 1749 - 1757
  • [48] Text-independent Speaker Recognition Based on One Third Octave Feature and Grey Relational Analysis
    Zhu Jianmin
    Zhang Lei
    Zhai Dongting
    Huang Zhiwen
    Wang Jun
    JOURNAL OF GREY SYSTEM, 2012, 24 (04) : 347 - 358
  • [49] Short Utterance Speaker Recognition Based on Speech High Frequency Information Compensation and Dynamic Feature Enhancement Methods
    Zi, Yunfei
    Xiong, Shengwu
    ARCHIVES OF ACOUSTICS, 2024, 49 (01) : 37 - 48
  • [50] Feature Extraction Using HHT-based Locally Optimized Short-Time Fractional Fourier Transform for Speaker Recognition
    Wang, Jinfang
    Du, Hailong
    Guo, Ming
    Nie, Xinli
    Luan, Shuxin
    Liu, Chang
    2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,