A novel approach to a robust a Priori SNR estimator in speech enhancement

被引:15
作者
Park, Yun-Sik [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Inha Univ, Dept Elect Engn, Inchon, South Korea
关键词
a priori SNR; decision-directed; speech enhancement; sigmoid type;
D O I
10.1093/ietcom/e90-b.8.2182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel approach to single channel speech enhancement in noisy environments. Widely adopted noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gain depending on the signal-to-noise ratio (SNR) [1]-[4]. As the estimation method of the SNR, the well-known decision-directed (DD) estimator of Ephraim and Malah efficiently is known to reduces musical noise in noise frames, but the a priori SNR, which is a crucial parameter of the spectral gain, follows the a posteriori SNR with a delay of one frame in speech frames [5]. Therefore, the noise suppression gain using the delayed a priori SNR, which is estimated by the DD algorithm matches the previous frame rather than the current one, so after noise suppression, this degrades the performance of a noise reduction during abrupt transient parts. To overcome this artifact, we propose a computationally simple but effective speech enhancement technique based on the sigmoid type function to adaptively determine the weighting factor of the DD algorithm. Actually, the proposed approach avoids the delay problem of the a priori SNR while maintaining the advantage of the DD algorithm. The performance of the proposed enhancement algorithm is evaluated by the objective and subjective test under various environments and yields better results compared with the conventional DD scheme based approach.
引用
收藏
页码:2182 / 2185
页数:4
相关论文
共 11 条
[1]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[2]   Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor [J].
Cappe, Olivier .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :345-349
[3]   Speech enhancement using a noncausal a priori SNR estimator [J].
Cohen, I .
IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (09) :725-728
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[5]  
Kim NS, 2000, IEEE SIGNAL PROC LET, V7, P108, DOI 10.1109/97.841154
[6]  
Ma N, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P717
[7]   SPEECH ENHANCEMENT USING A SOFT-DECISION NOISE SUPPRESSION FILTER [J].
MCAULAY, RJ ;
MALPASS, ML .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (02) :137-145
[8]  
Plapous C, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P289
[9]   A statistical model-based voice activity detection [J].
Sohn, J ;
Kim, NS ;
Sung, W .
IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (01) :1-3
[10]   Single channel speech enhancement based on masking properties of the human auditory system [J].
Virag, N .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02) :126-137