A FAMILY OF BAYESIAN STSA ESTIMATORS FOR THE ENHANCEMENT OF SPEECH WITH CORRELATED FREQUENCY COMPONENTS

被引:2
作者
Plourde, Eric [1 ]
Champagne, Benoit [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Speech enhancement; Bayesian estimation; short-time spectral amplitude;
D O I
10.1109/ICASSP.2010.5495159
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In Bayesian short-time spectral amplitude (STSA) estimation for speech enhancement, the spectral components are traditionally assumed uncorrelated. However, this assumption is inexact since some correlation is present in practice. We thus investigate a multidimensional STSA estimator that assumes correlated frequency components. Since the closed-form solution of this optimum estimator is not readily available, we previously derived closed-form expressions for an upper and a lower bound on the desired estimator. In this paper, we study the proximity between the upper and the lower bounds and propose a new family of estimators that are derived from these bounds and characterized by a scalar parameter 0 <= gamma <= 1, with gamma = 0 corresponding to the lower bound and gamma = 1 to the upper bound. Experimental results show that the proposed estimators achieve a better performance than existing estimators, especially at high SNR.
引用
收藏
页码:4766 / 4769
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 1988, Objective measures of speech quality
[2]  
[Anonymous], 2001, Discrete-Time Speech Signal Processing:Principles and Practice
[3]  
[Anonymous], 2017, P.862.2
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[5]  
Kabal P., 2005, Windows for Transform Processing
[6]   A block-based linear MMSE noise reduction with a high temporal resolution modeling of the speech excitation [J].
Li, CJ ;
Andersen, SV .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) :2965-2978
[7]  
O'Shaughnessy D., 2000, SPEECH COMMUN
[8]  
PLOURDE E, 2009, P 2009 IEEE WORKSH S
[9]  
*RIC U, SIGN PROC INF BAS NO
[10]  
Rudin W, 1987, REAL COMPLEX ANAL