Unconstrained vocal pattern recognition algorithm based on attention mechanism

被引：1

作者：

Li, Yaqian ^{[1
]}

Zhang, Xiaolong ^{[2
]}

Zhang, Xuyao ^{[3
]}

Li, Haibin ^{[1
]}

Zhang, Wenming ^{[4
]}

机构：

[1] Yanshan Univ, Pattern Recognized, Elect Engn, Qinhuangdao, Hebei, Peoples R China

[2] Yanshan Univ, Speaker Diarizat, Elect Engn, Qinhuangdao, Hebei, Peoples R China

[3] Yanshan Univ, Speaker Verificat, Elect Engn, Qinhuangdao, Hebei, Peoples R China

[4] Yanshan Univ, Camera Calibrat, Elect Engn, Qinhuangdao, Hebei, Peoples R China

来源：

DIGITAL SIGNAL PROCESSING | 2023年 / 136卷

基金：

中国国家自然科学基金;

关键词：

Voiceprint recognition; Unconstrained datasets; Attention mechanism; Feature fusion;

D O I：

10.1016/j.dsp.2023.103973

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning-based voiceprint recognition methods rely heavily on adequate datasets, especially those closer to the natural environment and more complex under unconstrained conditions. Yet, the data types of open-source speech datasets are too homogeneous nowadays, and there are some differences with the address collected in natural application environments. For few Chinese datasets used, this paper proposes and produces an unconstrained Chinese speech dataset with richer data types closer to those collected in a natural environment. To address the inadequate extraction of acoustic features in the unconstrained speech dataset, a new two-dimensional convolutional residual network structure based on the attention mechanism is designed and applied to acoustic feature extraction. The residual block structure in the residual network is improved by the SE module and the CBAM module to obtain the SE-Cov2d and CSA-Cov2d models respectively. Finally, it is experimentally demonstrated that the attention mechanism can help the network focus on more critical feature information and fuse more differentiated features in feature extraction. (c) 2023 Elsevier Inc. All rights reserved.

引用

页数：8

共 50 条

[1] Shoe Type Recognition Algorithm Based on Attention Mechanism
Zhang Jiajun
Tang Yunqi
Yang Zhixiong
Geng Pengzhi
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (02)
[2] A Pedestrian Detection Algorithm Based on Channel Attention Mechanism
Li, Weidong
Han, Shuang
Liu, Yang
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5954 - 5959
[3] Synthetic Aperture Radar SAR Image Target Recognition Algorithm Based on Attention Mechanism
Shi, Baodai
Zhang, Qin
Wang, Dayan
Li, Yao
IEEE ACCESS, 2021, 9 : 140512 - 140524
[4] Attention Mechanism Based Joint Optimization Algorithm for Defect Detection
Dong Y.
Sun S.
Wang Z.
Liu J.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (01): : 102 - 111
[5] Multi-view and multi-scale behavior recognition algorithm based on attention mechanism
Zhang, Di
Chen, Chen
Tan, Fa
Qian, Beibei
Li, Wei
He, Xuan
Lei, Susan
FRONTIERS IN NEUROROBOTICS, 2023, 17
[6] Link prediction algorithm based on attention mechanism
Cheng H.
Zhang L.
Fang Y.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2019, 47 (02): : 109 - 114
[7] Pattern recognition of surface electromyography based on multi-scale convolutional neural network with attention mechanism
Wang B.
Zheng H.
Jie J.
Zhang M.
Ke Y.
Liu Y.
International Journal of Wireless and Mobile Computing, 2022, 23 (3-4) : 293 - 301
[8] Multi-Scale Target Detection Algorithm Based on Attention Mechanism
Ju Moran
Luo Jiangning
Wang Zhongbo
Luo Haibo
ACTA OPTICA SINICA, 2020, 40 (13)
[9] A Novel Document Classification Algorithm Based on Statistical Features and Attention Mechanism
Li, Chao
Cheng, Yanfen
Wang, Hongxia
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[10] An underwater target recognition algorithm incorporating improved attention mechanism and downsampling
Zhu, QiGuang
Cen, Qiang
Wang, YuXin
Chen, WeiDong
Liu, Shuo
VISUAL COMPUTER, 2025, 41 (03): : 1499 - 1509

← 1 2 3 4 5 →