Model-Based Speech Enhancement in the Modulation Domain

被引:21
作者
Wang, Yu [1 ]
Brookes, Mike [2 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
关键词
Speech enhancement; modulation-domain Kalman filter; statistical modelling; minimum mean-square error (MMSE) estimator; SPECTRAL AMPLITUDE ESTIMATION; SQUARE ERROR ESTIMATION; NOISE; SUPPRESSION; ESTIMATORS; QUALITY;
D O I
10.1109/TASLP.2017.2786863
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents an algorithm for modulation-domain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical "Gaussring" model that comprises a mixture of Gaussians whose centers lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality measure, segmental SNR measure, and short-time objective intelligibility measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also showthat the Gaussring-model-based algorithm performs well for two types of noise.
引用
收藏
页码:580 / 594
页数:15
相关论文
共 51 条
[1]  
[Anonymous], 2000, TABLE INTEGRALS SERI
[2]  
[Anonymous], 2007, P INTERSPEECH
[3]  
[Anonymous], 2010, Handbook of Mathematical Functions
[4]  
[Anonymous], 2011, P IEEE WORKSH AUT SP
[5]  
[Anonymous], ACOUST SPEECH SIG PR
[6]  
[Anonymous], 2013, P INTERSPEECH
[7]   Joint acoustic and modulation frequency [J].
Atlas, L ;
Shamma, SA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (07) :668-675
[8]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[9]  
Brookes M., 1998, MATRIX REFERENCE MAN
[10]  
Brookes M., 1998, VOICEBOX SPEECH PROC