Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引:0
|
作者
Bai, Zhigang [1 ]
Bao, Changchun [1 ]
Cui, Zihao [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年
基金
中国国家自然科学基金;
关键词
speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Fusion of Amplitude and Complex Domains based on Deep Neural Networks for Speech Enhancement
    Deylami, Mohammad Saeed
    Seyedin, Sanaz
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1890 - 1894
  • [32] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
    Han, Wei
    Wu, Congming
    Zhang, Xiongwei
    Sun, Meng
    Min, Gang
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
  • [33] PERCEPTUAL IMPROVEMENT OF DEEP NEURAL NETWORKS FOR MONAURAL SPEECH ENHANCEMENT
    Han, Wei
    Zhang, Xiongwei
    Sun, Meng
    Shi, Wenhua
    Chen, Xushan
    Hu, Yonggang
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [34] COMBINING SPARSE NMF WITH DEEP NEURAL NETWORK: A NEW CLASSIFICATION-BASED APPROACH FOR SPEECH ENHANCEMENT
    Tseng, Hung-Wei
    Hong, Mingyi
    Luo, Zhi-Quan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2145 - 2149
  • [35] Ideal neighbourhood mask for speech enhancement using deep neural networks
    Arcos, Christian
    Vellasco, Marley
    Alcaim, Abraham
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] A novel NMF-based authentication scheme for encrypted speech in cloud computing
    Shi, Canghong
    Wang, Hongxia
    Hu, Yi
    Li, Xiaojie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25773 - 25798
  • [37] A novel NMF-based authentication scheme for encrypted speech in cloud computing
    Canghong Shi
    Hongxia Wang
    Yi Hu
    Xiaojie Li
    Multimedia Tools and Applications, 2021, 80 : 25773 - 25798
  • [38] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
    Yu, Hongjiang
    Ouyang, Zhiheng
    Zhu, Wei-Ping
    Champagne, Benoit
    Ji, Yunyun
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [39] A NEW SPEECH ENHANCEMENT APPROACH BASED ON PROGRESSIVE DEEP NEURAL NETWORKS
    Shu, Xiaofeng
    Zhou, Yi
    Cao, Yin
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 191 - 195
  • [40] Phase-Aware Speech Enhancement Based on Deep Neural Networks
    Zheng, Naijun
    Zhang, Xiao-Lei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 63 - 76