A Speaker Recognition Method Based on Dynamic Convolution with Dual Attention Mechanism

被引：0

作者：

Luo, Yuan ^{[1
]}

Zhu, Kuilin ^{[1
]}

Wang, Wenhao ^{[1
]}

Lin, Ziyao ^{[1
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Optoelect Engn, Chongqing 400065, Peoples R China

来源：

ENGINEERING LETTERS | 2023年 / 31卷 / 02期

关键词：

Speaker Recognition; Deep Learning; Attention Mechanism; Dynamic Convolution;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Deep neural networks have gained significant attention in text-independent speaker recognition tasks. However, due to the fixed parameters of traditional static convolutional neural networks, they cannot flexibly capture the variation in phonemes that are integral to speech sentences. To address this limitation, this paper proposes a channel-space attention-based dynamic convolutional speaker recognition method. This method employs dual-attention mechanisms to generate dynamic convolutional kernels, which improves the capture of phoneme variation information between different inputs in the speech signal. We conducted experiments using the TIMIT dataset to evaluate the proposed method's effectiveness in various network frameworks. Our results show that the best performance can be achieved when dynamic convolution is generated using four static convolutional kernels. Specifically, in the ResNet-34 framework, the Equal Error Rate (EER%) of the proposed method is improved by 31.1% over the static convolutional method CNN and by 20.3% over the single-attention dynamic convolutional method (DynamicConv). Additionally, the performance of the proposed method is enhanced in all other network frameworks. These findings demonstrate the effectiveness of the proposed method and the importance of considering phoneme variations in speaker recognition systems.

引用

页码：825 / 832

页数：1

共 50 条

[1] Face expression recognition based on attention mechanism of convolution network
Guo, Xin-Gang
Cheng, Chao
Shen, Zi-Qi
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (08): : 2319 - 2328
[2] Expression Recognition Based on Residual Attention Mechanism and Pyramid Convolution
Bao Z.
Chen H.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (06): : 497 - 506
[3] HandFormer: A Dynamic Hand Gesture Recognition Method Based on Attention Mechanism
Zhang, Yun
Wang, Fengping
APPLIED SCIENCES-BASEL, 2023, 13 (07):
[4] Dynamic Serpentine Convolution with Attention Mechanism Enhancement for Beef Cattle Behavior Recognition
Li, Guangbo
Shi, Guolong
Zhu, Changjie
ANIMALS, 2024, 14 (03):
[5] Transmission Line State Recognition Method Based on Dual-Branch Convolution Neural Network Structure and Multi- Attention Mechanism
Shang, Qiufeng
Fan, Xiaokai
Gu, Yuanyu
Wang, Jianjian
Yao, Guozhen
ACTA OPTICA SINICA, 2024, 44 (22)
[6] Speaker Adaptive Training for Speech Recognition Based on Attention-over-Attention Mechanism
Wan, Genshun
Pan, Jia
Wang, Qingran
Gao, Jianqing
Ye, Zhongfu
INTERSPEECH 2020, 2020, : 1251 - 1255
[7] An OCaNet Model Based on Octave Convolution and Attention Mechanism for Iris Recognition
Zou, Dong
Feng, Jianbing
He, Zhixin
Liu, Liping
Zhao, Meijun
Zheng, Lizhong
MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
[8] Human Action Recognition Method Based on Multi-Attention Mechanism and Spatiotemporal Graph Convolution Networks
Li, Xuanye
Hao, Xingwei
Jia, Jingong
Zhou, Yuanfeng
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (07): : 1055 - 1063
[9] Semantic segmentation of point clouds by fusing dual attention mechanism and dynamic graph convolution
Yang, Jun
Zhang, Chen
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (10): : 2984 - 2994
[10] Underwater small target detection based on dynamic convolution and attention mechanism
Cheng, Chensheng
Wang, Can
Yang, Dianyu
Wen, Xin
Liu, Weidong
Zhang, Feihu
FRONTIERS IN MARINE SCIENCE, 2024, 11

← 1 2 3 4 5 →