Speech enhancement from fused features based on deep neural network and gated recurrent unit network

被引:0
|
作者
Youming Wang
Jiali Han
Tianqi Zhang
Didi Qing
机构
[1] Xi’an University of Posts and Telecommunications,School of Automation
[2] Xi’an Key Laboratory of Advanced Control and Intelligent Process (ACIP),undefined
来源
EURASIP Journal on Advances in Signal Processing | / 2021卷
关键词
Speech enhancement; Deep neural network; Gated recurrent unit; Speech quality;
D O I
暂无
中图分类号
学科分类号
摘要
Speech is easily interfered by external environment in reality, which results in the loss of important features. Deep learning has become a popular speech enhancement method because of its superior potential in solving nonlinear mapping problems for complex features. However, the deficiency of traditional deep learning methods is the weak learning capability of important information from previous time steps and long-term event dependencies between the time-series data. To overcome this problem, we propose a novel speech enhancement method based on the fused features of deep neural networks (DNNs) and gated recurrent unit (GRU). The proposed method uses GRU to reduce the number of parameters of DNNs and acquire the context information of the speech, which improves the enhanced speech quality and intelligibility. Firstly, DNN with multiple hidden layers is used to learn the mapping relationship between the logarithmic power spectrum (LPS) features of noisy speech and clean speech. Secondly, the LPS feature of the deep neural network is fused with the noisy speech as the input of GRU network to compensate the missing context information. Finally, GRU network is performed to learn the mapping relationship between LPS features and log power spectrum features of clean speech spectrum. The proposed model is experimentally compared with traditional speech enhancement models, including DNN, CNN, LSTM and GRU. Experimental results demonstrate that the PESQ, SSNR and STOI of the proposed algorithm are improved by 30.72%, 39.84% and 5.53%, respectively, compared with the noise signal under the condition of matched noise. Under the condition of unmatched noise, the PESQ and STOI of the algorithm are improved by 23.8% and 37.36%, respectively. The advantage of the proposed method is that it uses the key information of features to suppress noise in both matched and unmatched noise cases and the proposed method outperforms other common methods in speech enhancement.
引用
收藏
相关论文
共 50 条
  • [1] Speech enhancement from fused features based on deep neural network and gated recurrent unit network
    Wang, Youming
    Han, Jiali
    Zhang, Tianqi
    Qing, Didi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [2] Speech enhancement method based on convolutional gated recurrent neural network
    Yuan W.
    Lou Y.
    Xia B.
    Sun W.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2019, 47 (04): : 13 - 18
  • [3] QUESTION DETECTION FROM ACOUSTIC FEATURES USING RECURRENT NEURAL NETWORK WITH GATED RECURRENT UNIT
    Tang, Yaodong
    Huang, Yuchen
    Wu, Zhiyong
    Meng, Helen
    Xu, Mingxing
    Cai, Lianhong
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6125 - 6129
  • [4] Speech enhancement based on simple recurrent unit network
    Cui, Xingyue
    Chen, Zhe
    Yin, Fuliang
    APPLIED ACOUSTICS, 2020, 157
  • [5] A Convolutional Gated Recurrent Network for Speech Enhancement
    Yuan W.-H.
    Hu S.-D.
    Shi Y.-L.
    Li Z.
    Liang C.-Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (07): : 1276 - 1283
  • [6] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [7] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [8] Fault diagnosis of rolling bearing based on deep convolutional neural network and gated recurrent unit
    Zhou, Zhexin
    Wang, Hao
    LI, Zhuoxian
    Chen, Wei
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2023, 17 (02)
  • [9] A new method for the prediction of network security situations based on recurrent neural network with gated recurrent unit
    Feng, Wei
    Wu, Yuqin
    Fan, Yexian
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2020, 13 (01) : 25 - 39
  • [10] RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT
    Abdulbaqi, Jalal
    Gu, Yue
    Chen, Shuhong
    Marsic, Ivan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6659 - 6663