A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing

被引:96
作者
Breithaupt, Colin [1 ]
Gerkmann, Timo [1 ]
Martin, Rainer [1 ]
机构
[1] Ruhr Univ Bochum, Inst Commun Acoust IKA, D-44780 Bochum, Germany
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech enhancement; decision-directed approach; SNR estimation; musical noise; cepstral analysis;
D O I
10.1109/ICASSP.2008.4518755
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
While state-of-the-art approaches obtain an estimate of the a priori SNR by adaptively smoothing its maximum likelihood estimate in the frequency domain, we selectively smooth the maximum likelihood estimate in the cepstral domain. In the cepstral domain the noisy speech signal is decomposed into coefficients related mainly to the speech envelope, the excitation, and noise. As in the cepstral. domain coefficients that represent speech can be robustly determined, we can apply little smoothing to speech coefficients and strong smoothing to noise coefficients. Thus, speech components are preserved and musical noise is suppressed. In speech enhancement experiments we obtain consistent improvements over the well known decision-directed approach.
引用
收藏
页码:4897 / 4900
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 1988, NAT I STANDARDS THEC
[2]   Cepstral smoothing of spectral filter gains for speech enhancement without musical noise [J].
Breithaupt, Colin ;
Gerkmann, Timo ;
Martin, Rainer .
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (12) :1036-1039
[3]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[5]   On second-order statistics and linear estimation of cepstral coefficients [J].
Ephraim, Y ;
Rahim, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02) :162-176
[6]  
Ephraim Y, 2006, Electrical Engineering Handbook, V3rd
[7]   Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model [J].
Lotter, T ;
Vary, P .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (07) :1110-1126
[8]   Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments [J].
Malah, D ;
Cox, RV ;
Accardi, AJ .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :789-792
[9]   Noise power spectral density estimation based on optimal smoothing and minimum statistics [J].
Martin, R .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05) :504-512
[10]   SPEECH ENHANCEMENT USING A SOFT-DECISION NOISE SUPPRESSION FILTER [J].
MCAULAY, RJ ;
MALPASS, ML .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (02) :137-145