Unconstrained vocal pattern recognition algorithm based on attention mechanism

被引:1
|
作者
Li, Yaqian [1 ]
Zhang, Xiaolong [2 ]
Zhang, Xuyao [3 ]
Li, Haibin [1 ]
Zhang, Wenming [4 ]
机构
[1] Yanshan Univ, Pattern Recognized, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[2] Yanshan Univ, Speaker Diarizat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[3] Yanshan Univ, Speaker Verificat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[4] Yanshan Univ, Camera Calibrat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Voiceprint recognition; Unconstrained datasets; Attention mechanism; Feature fusion;
D O I
10.1016/j.dsp.2023.103973
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning-based voiceprint recognition methods rely heavily on adequate datasets, especially those closer to the natural environment and more complex under unconstrained conditions. Yet, the data types of open-source speech datasets are too homogeneous nowadays, and there are some differences with the address collected in natural application environments. For few Chinese datasets used, this paper proposes and produces an unconstrained Chinese speech dataset with richer data types closer to those collected in a natural environment. To address the inadequate extraction of acoustic features in the unconstrained speech dataset, a new two-dimensional convolutional residual network structure based on the attention mechanism is designed and applied to acoustic feature extraction. The residual block structure in the residual network is improved by the SE module and the CBAM module to obtain the SE-Cov2d and CSA-Cov2d models respectively. Finally, it is experimentally demonstrated that the attention mechanism can help the network focus on more critical feature information and fuse more differentiated features in feature extraction. (c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Gait Recognition in Different Terrains with IMUs Based on Attention Mechanism Feature Fusion Method
    Mengxue Yan
    Ming Guo
    Jianqiang Sun
    Jianlong Qiu
    Xiangyong Chen
    Neural Processing Letters, 2023, 55 : 10215 - 10234
  • [32] Pin-missing defect recognition based on feature fusion and spatial attention mechanism
    He, Hui
    Li, Yuchen
    Yang, Jing
    Wang, Zeli
    Chen, Bo
    Jiao, Runhai
    ENERGY REPORTS, 2022, 8 : 656 - 663
  • [33] Gait Recognition in Different Terrains with IMUs Based on Attention Mechanism Feature Fusion Method
    Yan, Mengxue
    Guo, Ming
    Sun, Jianqiang
    Qiu, Jianlong
    Chen, Xiangyong
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 10215 - 10234
  • [34] Tobacco Retail License Recognition Based on Dual Attention Mechanism
    Shan, Yuxiang
    Ren, Qin
    Wang, Cheng
    Wang, Xiuhui
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2022, 18 (04): : 480 - 488
  • [35] Facial expression recognition based on facial part attention mechanism
    Zhong, Qiubo
    Fang, Baofu
    Wei, Shenbin
    Wang, Zaijun
    Zhang, Haoxiang
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [36] Maize Leaf Compound Disease Recognition Based on Attention Mechanism
    Dong, Ping
    Li, Kuo
    Wang, Ming
    Li, Feitao
    Guo, Wei
    Si, Haiping
    AGRICULTURE-BASEL, 2024, 14 (01):
  • [37] Food Recognition Model Based on Deep Learning and Attention Mechanism
    He, Lili
    Cai, Zhiwei
    Ouyang, Dantong
    Bai, Hongtao
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 331 - 341
  • [38] Multiple premises entailment recognition based on attention and gate mechanism
    Wu, Pin
    Lei, Zhidan
    Zhou, Quan
    Zhu, Rukang
    Chang, Xuting
    Sun, Junwu
    Zhang, Wenjie
    Guo, Yike
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 147 (147)
  • [39] Facial Expression Recognition Based on Multiscale Features and Attention Mechanism
    Yao, Lisha
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (04) : 429 - 440
  • [40] Food Recognition Model Based on Deep Learning and Attention Mechanism
    He, Lili
    Cai, Zhiwei
    Ouyang, Dantong
    Bai, Hongtao
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 206 - 216