Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement

被引:2
作者
Trawicki, M. B. [1 ]
Johnson, M. T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53201 USA
关键词
acoustic arrays; speech enhancement; parameter estimation; NOISE; GAMMA;
D O I
10.1002/acs.2534
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the minimum mean-square error (MMSE) -order estimator for multichannel speech enhancement is proposed. The estimator is an extension of the single-channel MMSE -order and multichannel MMSE short-time spectral amplitude estimators using Rayleigh and Gaussian distributions for the statistical models under the assumption of a diffuse noise field where the noise is estimated independently across each of the microphones. Experiments are performed to evaluate the new estimator against the baseline single-channel and multichannel estimators using various values of the parameter and number of microphones along with different levels of noises as a function of the input signal-to-noise ratio. By the utilization of additional microphones, the multichannel MMSE -order estimator achieves performance gains in noise reduction, speech distortion, and speech quality as measured by the segmental signal-to-noise ratio, log-likelihood ratio, and perceptual evaluation of speech quality objective metrics. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1287 / 1295
页数:9
相关论文
共 28 条
[1]   Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors [J].
Andrianakis, I. ;
White, P. R. .
SPEECH COMMUNICATION, 2009, 51 (01) :1-14
[2]  
[Anonymous], 2001, PERC EV SPEECH QUAL
[3]  
[Anonymous], 2000, Tables of Integrals, Series, and Products
[4]  
Brandstein M, 2001, DIGITAL SIGNAL PROC, P133
[5]   Parameterized MMSE spectral magnitude estimation for the enhancement of noisy speech [J].
Breithaupt, Colin ;
Krawczyk, Martin ;
Martin, Rainer .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4037-4040
[6]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[7]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[8]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[9]   Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors [J].
Erkelens, Jan S. ;
Hendriks, Richard C. ;
Heusdens, Richard ;
Jensen, Jesper .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06) :1741-1752
[10]  
Garofolo S., 1993, Timit acousticphonetic continuous speech corpus