Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引:0
|
作者
Bai, Zhigang [1 ]
Bao, Changchun [1 ]
Cui, Zihao [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年
基金
中国国家自然科学基金;
关键词
speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
    Garg, Anil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675
  • [42] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [43] Deep Neural Networks for Speech Enhancement in Complex-Noisy Environments
    Saleem, Nasir
    Khattak, Muhammad Irfan
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (01): : 84 - 90
  • [44] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
    Jaeuk Byun
    Jong Won Shin
    中国通信, 2019, 16 (09) : 177 - 186
  • [45] COMPRESSING DEEP NEURAL NETWORKS FOR EFFICIENT SPEECH ENHANCEMENT
    Tan, Ke
    Wang, DeLiang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8358 - 8362
  • [46] SPEECH ENHANCEMENT FOR HEARING-IMPAIRED LISTENERS USING DEEP NEURAL NETWORKS WITH AUDITORY-MODEL BASED FEATURES
    Goehring, Tobias
    Yang, Xin
    Monaghan, Jessica J. M.
    Bleeck, Stefan
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2300 - 2304
  • [47] A UNIFIED SPEAKER-DEPENDENT SPEECH SEPARATION AND ENHANCEMENT SYSTEM BASED ON DEEP NEURAL NETWORKS
    Gao, Tian
    Du, Jun
    Xu, Li
    Liu, Cong
    Dai, Li-Rong
    Lee, Chin-Hui
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 687 - 691
  • [48] Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
    Kumar, Anurag
    Florencio, Dinei
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3738 - 3742
  • [49] Comparison of discrete transforms for deep-neural-networks-based speech enhancement
    Jassim, Wissam A.
    Harte, Naomi
    IET SIGNAL PROCESSING, 2022, 16 (04) : 438 - 448
  • [50] Regularized sparse features for noisy speech enhancement using deep neural networks
    Khattak, Muhammad Irfan
    Saleem, Nasir
    Gao, Jiechao
    Verdu, Elena
    Fuente, Javier Parra
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100