Speech enhancement using the modified phase-opponency model

被引:8
作者
Deshmukh, Om D. [1 ]
Espy-Wilson, Carol Y.
Carney, Laurel H.
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Syst Res Inst, College Pk, MD 20742 USA
[3] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Inst Sensory Res, Syracuse, NY 13244 USA
关键词
D O I
10.1121/1.2714913
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a model called the Modified Phase-Opponency (MPO) model for single-channel speech enhancement when the speech is corrupted by additive noise. The MPO,model is based on the auditory PO model, proposed for detection of tones in noise. The PO model includes a physiologically realistic mechanism for processing the information in neural discharge times and exploits the frequency-dependent phase properties of the tuned filters in the auditory periphery by using a cross-auditory-nerve-fiber coincidence detection for extracting temporal cues. The MPO model alters the components of the PO model such that the basic functionality of the PO model is maintained but the properties of the model can be analyzed and modified independently. The MPO-based speech enhancement scheme does not need to estimate the noise characteristics nor does it assume that the noise satisfies any statistical model. The MPO technique leads to the lowest value of the LPC-based objective measures and the highest value of the perceptual evaluation of speech quality measure compared to other methods when the speech signals are corrupted by fluctuating noise. Combining the MPO speech enhancement technique with our aperiodicity, periodicity, and pitch detector further improves its performance. (c) 2007 Acoustical Society of America.
引用
收藏
页码:3886 / 3898
页数:13
相关论文
共 29 条
[1]  
[Anonymous], P IEEE INT C AC SPEE
[2]  
ANZALONE MC, 2006, THESIS SYRACUSE U
[3]  
Beh J, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P648
[4]  
Benesty J, 2005, SPEECH ENHANCEMENT
[5]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[6]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[7]  
Carney LH, 2002, ACTA ACUST UNITED AC, V88, P334
[8]   SPEECH ENHANCEMENT BASED CONCEPTUALLY ON AUDITORY EVIDENCE [J].
CHENG, YM ;
OSHAUGHNESSY, D .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) :1943-1954
[9]   Speech enhancement using a noncausal a priori SNR estimator [J].
Cohen, I .
IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (09) :725-728
[10]  
COMPERNOLLE DV, 1992, ESCA WORKSH SPEECH P, P21