Auditory-Based Spectral Amplitude Estimators for Speech Enhancement

被引:42
作者
Plourde, Eric [1 ]
Champagne, Benoit [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 08期
基金
加拿大自然科学与工程研究理事会;
关键词
Bayesian estimators; human auditory system; short-time spectral amplitude; speech enhancement;
D O I
10.1109/TASL.2008.2004304
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new family of Bayesian estimators for speech enhancement where the cost function includes both a power law and a weighting factor. The parameters of the cost function, and therefore of the corresponding estimator gain, are chosen based on characteristics of the human auditory system, namely, the compressive nonlinearities of the cochlea, the perceived loudness and the ear's masking properties. It is found that choosing the parameters in this way results in a decrease of the estimator gain at high frequencies. This frequency dependence of the gain improves the noise reduction while limiting the speech distortion. Experimental results show that the new estimators achieve better enhancement performance than existing Bayesian estimators such as those based on the minimum mean-square error (MMSE) of the short-time spectral amplitude (STSA), the MMSE of the logarithm of the STSA (LSA) or the weighted euclidien (WE) error, both in terms of objective and subjective measures.
引用
收藏
页码:1614 / 1623
页数:10
相关论文
共 43 条