Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

被引：0

作者：

Bai, Zhigang ^{[1
]}

Bao, Changchun ^{[1
]}

Cui, Zihao ^{[1
]}

机构：

[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

speech enhancement; nonnegative matrix factorization; deep neural networks; NMF-based Wiener filter;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a novel approach is presented to predict a training target called NMF-based Wiener filter using deep neural networks (DNN) in the nonnegative matrix factorization (NMF) based speech enhancement. The NMF-based Wiener filter, as a masking-based target, is easier than the encoding vectors used in previous algorithms for parameter estimation. The intermediate error of the NMF-based speech enhancement process was reduced due to direct prediction of the NMF-based Wiener filter. The encoding vectors of noisy speech were extracted with the NMF algorithm and normalized to obtain more discriminative input features. The DNN was trained to learn a nonlinear mapping from the encoding vector of noisy speech to the NMF-based Wiener filter. At test stage, the predicted NMF-based Wiener filter was used to enhance noisy speech. The objective evaluations demonstrated that the proposed algorithm outperforms some existing NMF-based and DNN-based methods at various input signal-to-noise ratio (SNR) levels.

引用

页数：5

共 50 条

[31] Fusion of Amplitude and Complex Domains based on Deep Neural Networks for Speech Enhancement
Deylami, Mohammad Saeed
Seyedin, Sanaz
2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1890 - 1894
[32] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
Han, Wei
Wu, Congming
Zhang, Xiongwei
Sun, Meng
Min, Gang
PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
[33] PERCEPTUAL IMPROVEMENT OF DEEP NEURAL NETWORKS FOR MONAURAL SPEECH ENHANCEMENT
Han, Wei
Zhang, Xiongwei
Sun, Meng
Shi, Wenhua
Chen, Xushan
Hu, Yonggang
2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[34] COMBINING SPARSE NMF WITH DEEP NEURAL NETWORK: A NEW CLASSIFICATION-BASED APPROACH FOR SPEECH ENHANCEMENT
Tseng, Hung-Wei
Hong, Mingyi
Luo, Zhi-Quan
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2145 - 2149
[35] Ideal neighbourhood mask for speech enhancement using deep neural networks
Arcos, Christian
Vellasco, Marley
Alcaim, Abraham
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[36] A novel NMF-based authentication scheme for encrypted speech in cloud computing
Shi, Canghong
Wang, Hongxia
Hu, Yi
Li, Xiaojie
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25773 - 25798
[37] A novel NMF-based authentication scheme for encrypted speech in cloud computing
Canghong Shi
Hongxia Wang
Yi Hu
Xiaojie Li
Multimedia Tools and Applications, 2021, 80 : 25773 - 25798
[38] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
Yu, Hongjiang
Ouyang, Zhiheng
Zhu, Wei-Ping
Champagne, Benoit
Ji, Yunyun
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[39] A NEW SPEECH ENHANCEMENT APPROACH BASED ON PROGRESSIVE DEEP NEURAL NETWORKS
Shu, Xiaofeng
Zhou, Yi
Cao, Yin
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 191 - 195
[40] Phase-Aware Speech Enhancement Based on Deep Neural Networks
Zheng, Naijun
Zhang, Xiao-Lei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 63 - 76

← 1 2 3 4 5 →