Multi-layer Feature Augmentation Based Transferable Adversarial Examples Generation for Speaker Recognition

被引：0

作者：

Li, Zhuhai ^{[1
]}

Zhang, Jie ^{[1
]}

Guo, Wu ^{[1
]}

机构：

[1] Univ Sci & Technol China, NERC SLIP, Hefei 230027, Peoples R China

来源：

ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024 | 2024年 / 14865卷

关键词：

Adversarial Attack; Transferability; Speaker Recognition;

D O I：

10.1007/978-981-97-5591-2_32

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Adversarial examples that almost remain imperceptible for human can mislead practical speaker recognition systems. However, most existing adversaries generated by substitute models have a poor transferability to attack the unseen victim models. To tackle this problem, in this work we propose a multilayer feature augmentation method to improve the transferability of adversarial examples. Specifically, we apply data augmentation on the intermediate-layer feature maps of the substitute model to create diverse pseudo victim models. By attacking the ensemble of the substitute model and the corresponding augmented models, the proposed method can help the adversarial examples avoid overfitting, resulting in more transferable adversarial examples. Experimental results on the VoxCeleb dataset verify the effectiveness of the proposed approach for the speaker identification and speaker verification tasks.

引用

页码：373 / 385

页数：13

共 50 条

[41] PART-BASED FEATURE SQUEEZING TO DETECT ADVERSARIAL EXAMPLES IN PERSON RE-IDENTIFICATION NETWORKS
Zheng, Yu
Velipasalar, Senem
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 844 - 848
[42] MULTI-VIEW SELF-ATTENTION BASED TRANSFORMER FOR SPEAKER RECOGNITION
Wang, Rui
Ao, Junyi
Zhou, Long
Liu, Shujie
Wei, Zhihua
Ko, Tom
Li, Qing
Zhang, Yu
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6732 - 6736
[43] Multi-task learning for X-vector based speaker recognition
Zhang Y.
Liu L.
International Journal of Speech Technology, 2023, 26 (04) : 817 - 823
[44] Speaker recognition based on characteristic spectrograms and an improved self-organizing feature map neural network
Jia, Yanjie
Chen, Xi
Yu, Jieqiong
Wang, Lianming
Xu, Yuanzhe
Liu, Shaojin
Wang, Yonghui
COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (04) : 1749 - 1757
[45] The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition
Turner, Claude
Joseph, Anthony
Aksu, Murat
Langdon, Heather
COMPLEX ADAPTIVE SYSTEMS, 2011, 6
[46] Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
Zhang, Yu-Jia
Wang, Yih-Wen
Chen, Chia-Ping
Lu, Chung-Li
Chan, Bo-Cheng
INTERSPEECH 2021, 2021, : 76 - 80
[47] Speaker recognition based on characteristic spectrograms and an improved self-organizing feature map neural network
Yanjie Jia
Xi Chen
Jieqiong Yu
Lianming Wang
Yuanzhe Xu
Shaojin Liu
Yonghui Wang
Complex & Intelligent Systems, 2021, 7 : 1749 - 1757
[48] Text-independent Speaker Recognition Based on One Third Octave Feature and Grey Relational Analysis
Zhu Jianmin
Zhang Lei
Zhai Dongting
Huang Zhiwen
Wang Jun
JOURNAL OF GREY SYSTEM, 2012, 24 (04) : 347 - 358
[49] Short Utterance Speaker Recognition Based on Speech High Frequency Information Compensation and Dynamic Feature Enhancement Methods
Zi, Yunfei
Xiong, Shengwu
ARCHIVES OF ACOUSTICS, 2024, 49 (01) : 37 - 48
[50] Feature Extraction Using HHT-based Locally Optimized Short-Time Fractional Fourier Transform for Speaker Recognition
Wang, Jinfang
Du, Hailong
Guo, Ming
Nie, Xinli
Luan, Shuxin
Liu, Chang
2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,

← 1 2 3 4 5 →