HMM-Based strategies for enhancement of speech signals embedded in nonstationary noise

被引：125

作者：

Sameti, H ^{[1
]}

Sheikhzadeh, H ^{[1
]}

Deng, L ^{[1
]}

Brennan, RL ^{[1
]}

机构：

[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/89.709670

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An improved hidden Markov model-based (HMM-based) speech enhancement system designed using the minimum mean square error principle is implemented and compared with a conventional spectral subtraction system, The improvements to the system are: 1) incorporation of mixture components in the HMM for noise in order to handle noise nonstationarity in a more flexible manner, 2) two efficient methods in the speech enhancement system design that make the system realtime implementable, and 3) an adaptation method to the noise type in order to accommodate a wide variety of noises expected under the enhancement system's operating environment. The results of the experiments designed to evaluate the performance of the HMM-based speech enhancement systems in comparison with spectral subtraction are reported. Three types of noise-white noise, simulated helicopter noise, and multitalker (cocktail party) noise-were used to corrupt the test speech signals. Both objective (global SNR) and subjective mean opinion score (MOS) evaluations demonstrate consistent superiority of the HMM-based enhancement systems that incorporate the innovations described in this paper over the conventional spectral subtraction method.

引用

页码：445 / 455

页数：11

共 25 条

[1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[2] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING 2 MICROPHONE ADAPTIVE NOISE CANCELLATION [J].

BOLL, SF ;

PULSIPHER, DC .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (06) :752-753

[3]

BOLL SF, 1991, ADV SPEECH SIGNAL PR, P309

[4] SPEECH CODING BASED UPON VECTOR QUANTIZATION [J].

BUZO, A ;

GRAY, AH ;

GRAY, RM ;

MARKEL, JD .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (05) :562-574

[5] ON THE APPLICATION OF HIDDEN MARKOV-MODELS FOR ENHANCING NOISY SPEECH [J].

EPHRAIM, Y ;

MALAH, D ;

JUANG, BH .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :1846-1856

[6]

EPHRAIM Y, 1990, P ICASSP, P829

[7]

FRAZIER RH, 1976, P IEEE ICASSP, P251

[8]

Gersho A., 1992, VECTOR QUANTIZATION

[9]

Gray R. M., 1993, TOEPLITZ CIRCULANT M

[10] DISTORTION MEASURES FOR SPEECH PROCESSING [J].

GRAY, RM ;

BUZO, A ;

GRAY, AH ;

MATSUYAMA, Y .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :367-376

← 1 2 3 →