A two-channel minimum mean-square error log-spectral amplitude estimator for speech enhancement

被引:0
作者
Choi, Min-Seok [1 ]
Kang, Hong-Goo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
来源
2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS | 2008年
关键词
speech enhancement; two-channel speech enhancement; noise power spectral density estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a novel two-channel speech enhancement structure using the minimum mean-square error log-spectral amplitude (MMSE-LSA) estimator. The proposed two-channel enhancement algorithm utilizes a spatial relationship between two input signals to accurately estimate the noise power spectral density (PSD) needed for the MMSE-LSA algorithm. The proposed structure improves the noise reduction capacity with less speech distortion, while its complexity is much lower than simple cascade structures. The performance of the proposed algorithm is evaluated by automatic speech recognition tests in a car environment. Comparing to a simple cascading of two- and single-channel algorithms, the proposed algorithm improves the relative recognition rate by 17.5 % for high speed conditions and 14.8 % for low speed conditions, respectively.
引用
收藏
页码:153 / 156
页数:4
相关论文
共 9 条
[1]  
Brandstein M, 2001, DIGITAL SIGNAL PROC, P133
[2]   FREQUENCY-DOMAIN IMPLEMENTATION OF GRIFFITHS-JIM ADAPTIVE BEAMFORMER [J].
CHEN, YH ;
FANG, HD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (06) :3354-3366
[3]  
Choi MS, 2007, INT CONF ACOUST SPEE, P893
[4]   Analysis of two-channel generalized sidelobe canceller (GSC) with post-filtering [J].
Cohen, I .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06) :684-699
[5]   Speech enhancement for non-stationary noise environments [J].
Cohen, I ;
Berdugo, B .
SIGNAL PROCESSING, 2001, 81 (11) :2403-2418
[6]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[7]   CONSTRAINED ITERATIVE SPEECH ENHANCEMENT WITH APPLICATION TO SPEECH RECOGNITION [J].
HANSEN, JHL ;
CLEMENTS, MA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) :795-805
[8]   A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters [J].
Hoshuyama, O ;
Sugiyama, A ;
Hirano, A .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (10) :2677-2684
[9]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512