Speech enhancement based on Bayesian decision and spectral amplitude estimation

被引：3

作者：

Deng, Feng ^{[1
]}

Bao, Chang-Chun ^{[1
]}

机构：

[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

来源：

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2015年

基金：

中国国家自然科学基金;

关键词：

Speech enhancement; Bayesian decision; Spectral amplitude estimation; Combined Bayesian risk function; General weighted cost function; NOISE;

D O I：

10.1186/s13636-015-0073-6

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a single-channel speech enhancement method based on Bayesian decision and spectral amplitude estimation is proposed, in which the speech detection module and spectral amplitude estimation module are included, and the two modules are strongly coupled. First, under the decisions of speech presence and speech absence, the optimal speech amplitude estimators are obtained by minimizing a combined Bayesian risk function, respectively. Second, using the obtained spectral amplitude estimators, the optimal speech detector is achieved by further minimizing the combined Bayesian risk function. Finally, according to the detection results of speech detector, the optimal decision rule is made and the optimal spectral amplitude estimator is chosen for enhancing noisy speech. Furthermore, by considering both detection and estimation errors, we propose a combined cost function which incorporates two general weighted distortion measures for the speech presence and speech absence of the spectral amplitudes, respectively. The cost parameters in the cost function are employed to balance the speech distortion and residual noise caused by missed detection and false alarm, respectively. In addition, we propose two adaptive calculation methods for the perceptual weighted order p and the spectral amplitude order beta concerned in the proposed cost function, respectively. The objective and subjective test results indicate that the proposed method can achieve a more significant segmental signal-noise ratio (SNR) improvement, a lower log-spectral distortion, and a better speech quality than the reference methods.

引用

页数：18

共 29 条

[1] Simultaneous detection and estimation approach for speech enhancement
Abramson, Ari
Cohen, Israel
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2348 - 2359
[2] [Anonymous], 2001, REC P 862 PERC EV SP
[3] [Anonymous], 2014, Table of Integrals, Series, and Products
[4] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
BOLL, SF
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
[5] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[6] Speech enhancement for non-stationary noise environments
Cohen, I
Berdugo, B
[J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
[7] Deng F, 2013, INTERSPEECH, P3233
[8] Speech enhancement using generalized weighted β-order spectral amplitude estimator
Deng, Feng
Bao, Feng
Bao, Chang-chun
[J]. SPEECH COMMUNICATION, 2014, 59 : 55 - 68
[9] DE-NOISING BY SOFT-THRESHOLDING
DONOHO, DL
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) : 613 - 627
[10] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445

← 1 2 3 →