Suppression of additive noise using a power spectral density MMSE estimator

被引:10
作者
Ding, GH [1 ]
Huang, T
Xu, B
机构
[1] Chinese Acad Sci, Inst Automat, High Tech Innovat Ctr, Beijing 100080, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
关键词
exponential distribution; Gaussian distribution; minimum mean-square error (MMSE); power spectral density (PSD);
D O I
10.1109/LSP.2004.826660
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose a novel speech enhancement approach, called power spectral density minimum mean-square error (PSD-MMSE) estimation-based speech enhancement, which is implemented in the power spectral domain where stationary stochastic noise can be modeled as the exponential distribution. Speech magnitude-squared spectra are modeled as the mixed exponential distribution. And an MMSE estimator is constructed based on the parametric distributions. Besides, a fast algorithm is presented to implement the approach in real time. Experimental results of Itakura-Saito distortion measures show that the proposed approach is superior to alternative speech enhancement algorithms.
引用
收藏
页码:585 / 588
页数:4
相关论文
共 10 条
[1]  
ACCARDI AJ, 1909, P ICASSP
[2]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[3]  
BREITHAUPT C, 2003, P ICASSP
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[5]   A BAYESIAN-ESTIMATION APPROACH FOR SPEECH ENHANCEMENT USING HIDDEN MARKOV-MODELS [J].
EPHRAIM, Y .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (04) :725-735
[6]   STATISTICAL-MODEL-BASED SPEECH ENHANCEMENT SYSTEMS [J].
EPHRAIM, Y .
PROCEEDINGS OF THE IEEE, 1992, 80 (10) :1526-1555
[7]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[8]   SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02) :236-243
[9]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[10]   Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises [J].
Zhao, YX .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (03) :255-266