Unconstrained vocal pattern recognition algorithm based on attention mechanism

被引:1
|
作者
Li, Yaqian [1 ]
Zhang, Xiaolong [2 ]
Zhang, Xuyao [3 ]
Li, Haibin [1 ]
Zhang, Wenming [4 ]
机构
[1] Yanshan Univ, Pattern Recognized, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[2] Yanshan Univ, Speaker Diarizat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[3] Yanshan Univ, Speaker Verificat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[4] Yanshan Univ, Camera Calibrat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Voiceprint recognition; Unconstrained datasets; Attention mechanism; Feature fusion;
D O I
10.1016/j.dsp.2023.103973
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning-based voiceprint recognition methods rely heavily on adequate datasets, especially those closer to the natural environment and more complex under unconstrained conditions. Yet, the data types of open-source speech datasets are too homogeneous nowadays, and there are some differences with the address collected in natural application environments. For few Chinese datasets used, this paper proposes and produces an unconstrained Chinese speech dataset with richer data types closer to those collected in a natural environment. To address the inadequate extraction of acoustic features in the unconstrained speech dataset, a new two-dimensional convolutional residual network structure based on the attention mechanism is designed and applied to acoustic feature extraction. The residual block structure in the residual network is improved by the SE module and the CBAM module to obtain the SE-Cov2d and CSA-Cov2d models respectively. Finally, it is experimentally demonstrated that the attention mechanism can help the network focus on more critical feature information and fuse more differentiated features in feature extraction. (c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Shoe Type Recognition Algorithm Based on Attention Mechanism
    Zhang Jiajun
    Tang Yunqi
    Yang Zhixiong
    Geng Pengzhi
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (02)
  • [2] A Pedestrian Detection Algorithm Based on Channel Attention Mechanism
    Li, Weidong
    Han, Shuang
    Liu, Yang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5954 - 5959
  • [3] Synthetic Aperture Radar SAR Image Target Recognition Algorithm Based on Attention Mechanism
    Shi, Baodai
    Zhang, Qin
    Wang, Dayan
    Li, Yao
    IEEE ACCESS, 2021, 9 : 140512 - 140524
  • [4] Attention Mechanism Based Joint Optimization Algorithm for Defect Detection
    Dong Y.
    Sun S.
    Wang Z.
    Liu J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (01): : 102 - 111
  • [5] Multi-view and multi-scale behavior recognition algorithm based on attention mechanism
    Zhang, Di
    Chen, Chen
    Tan, Fa
    Qian, Beibei
    Li, Wei
    He, Xuan
    Lei, Susan
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [6] Link prediction algorithm based on attention mechanism
    Cheng H.
    Zhang L.
    Fang Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2019, 47 (02): : 109 - 114
  • [7] Pattern recognition of surface electromyography based on multi-scale convolutional neural network with attention mechanism
    Wang B.
    Zheng H.
    Jie J.
    Zhang M.
    Ke Y.
    Liu Y.
    International Journal of Wireless and Mobile Computing, 2022, 23 (3-4) : 293 - 301
  • [8] Multi-Scale Target Detection Algorithm Based on Attention Mechanism
    Ju Moran
    Luo Jiangning
    Wang Zhongbo
    Luo Haibo
    ACTA OPTICA SINICA, 2020, 40 (13)
  • [9] A Novel Document Classification Algorithm Based on Statistical Features and Attention Mechanism
    Li, Chao
    Cheng, Yanfen
    Wang, Hongxia
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [10] An underwater target recognition algorithm incorporating improved attention mechanism and downsampling
    Zhu, QiGuang
    Cen, Qiang
    Wang, YuXin
    Chen, WeiDong
    Liu, Shuo
    VISUAL COMPUTER, 2025, 41 (03): : 1499 - 1509