A NEW LINEAR MMSE FILTER FOR SINGLE CHANNEL SPEECH ENHANCEMENT BASED ON NONNEGATIVE MATRIX FACTORIZATION

被引：0

作者：

Mohammadiha, Nasser ^{[1
]}

Gerkmann, Timo ^{[1
]}

Leijon, Arne ^{[1
]}

机构：

[1] KTH Royal Inst Technol, Sound & Image Proc Lab, Stockholm, Sweden

来源：

2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2011年

关键词：

Speech enhancement; nonnegative matrix factorization; Linear MMSE filter; AUDIO SOURCE SEPARATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a linear MMSE filter is derived for single-channel speech enhancement which is based on Nonnegative Matrix Factorization (NMF). Assuming an additive model for the noisy observation, an estimator is obtained by minimizing the mean square error between the clean speech and the estimated speech components in the frequency domain. In addition, the noise power spectral density (PSD) is estimated using NMF and the obtained noise PSD is used in a Wiener filtering framework to enhance the noisy speech. The results of the both algorithms are compared to the result of the same Wiener filtering framework in which the noise PSD is estimated using a recently developed MMSE-based method. NMF based approaches outperform the Wiener filter with the MMSE-based noise PSD tracker for different measures. Compared to the NMF-based Wiener filtering approach, Source to Distortion Ratio (SDR) is improved for the evaluated noise types for different input SNRs using the proposed linear MMSE filter.

引用

页码：45 / 48

页数：4

共 16 条

[1] [Anonymous], 2000, NIPS
[2] CICHOCKI A, 2006, IEEE INT C ICASSP
[3] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
[4] EPHRAIM Y, 2005, RECENT ADV SPEECH EN
[5] Fevotte C., 2009, EUSIPCO
[6] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis
Fevotte, Cedric
Bertin, Nancy
Durrieu, Jean-Louis
[J]. NEURAL COMPUTATION, 2009, 21 (03) : 793 - 830
[7] Hendriks R. C., 2010, IEEE INT C ICASSP
[8] *IT, 2000, P862 IT
[9] Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
Lotter, T
Vary, P
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (07) : 1110 - 1126
[10] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
Ozerov, Alexey
Fevotte, Cedric
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563

← 1 2 →