Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引：0

作者：

Bai, Zhigang ^{[1
]}

Bao, Changchun ^{[1
]}

Cui, Zihao ^{[1
]}

机构：

[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.

引用

页数：5

共 50 条

[41] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
Garg, Anil
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675
[42] Supervised speech enhancement based on deep neural network
Saleem, Nasir
Khattak, Muhammad Irfan
Qazi, Abdul Baser
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
[43] Deep Neural Networks for Speech Enhancement in Complex-Noisy Environments
Saleem, Nasir
Khattak, Muhammad Irfan
INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (01): : 84 - 90
[44] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
Jaeuk Byun
Jong Won Shin
中国通信, 2019, 16 (09) : 177 - 186
[45] COMPRESSING DEEP NEURAL NETWORKS FOR EFFICIENT SPEECH ENHANCEMENT
Tan, Ke
Wang, DeLiang
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8358 - 8362
[46] SPEECH ENHANCEMENT FOR HEARING-IMPAIRED LISTENERS USING DEEP NEURAL NETWORKS WITH AUDITORY-MODEL BASED FEATURES
Goehring, Tobias
Yang, Xin
Monaghan, Jessica J. M.
Bleeck, Stefan
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2300 - 2304
[47] A UNIFIED SPEAKER-DEPENDENT SPEECH SEPARATION AND ENHANCEMENT SYSTEM BASED ON DEEP NEURAL NETWORKS
Gao, Tian
Du, Jun
Xu, Li
Liu, Cong
Dai, Li-Rong
Lee, Chin-Hui
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 687 - 691
[48] Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
Kumar, Anurag
Florencio, Dinei
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3738 - 3742
[49] Comparison of discrete transforms for deep-neural-networks-based speech enhancement
Jassim, Wissam A.
Harte, Naomi
IET SIGNAL PROCESSING, 2022, 16 (04) : 438 - 448
[50] Regularized sparse features for noisy speech enhancement using deep neural networks
Khattak, Muhammad Irfan
Saleem, Nasir
Gao, Jiechao
Verdu, Elena
Fuente, Javier Parra
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100

← 1 2 3 4 5 →