Towards improving speech detection robustness for speech recognition in adverse conditions

被引:42
作者
Karray, L
Martin, A
机构
[1] IPS, DIH, FTR&D, F-22307 Lannion, France
[2] Univ Bretagne Sud, IUT Vannes, F-56000 Vannes, France
关键词
ALGORITHM;
D O I
10.1016/S0167-6393(02)00066-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition performance decreases when recognition systems are used over the telephone network, especially wireless network and noisy environments. It appears that non-efficient speech/non-speech detection (SND) is an important source of this degradation. Therefore, speech detection robustness to noise is a challenging problem to be examined, in order to improve recognition performance for the very noisy communications. Several studies were conducted aiming to improve the robustness of SND used for speech recognition in adverse conditions. The present paper proposes some solutions aiming to improve SND in wireless environment. Speech enhancement prior detection is considered. Then, two versions of SND algorithm, based on statistical criteria, are proposed and compared. Finally, a post-detection technique is introduced in order to reject the wrongly detected noise segments. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:261 / 276
页数:16
相关论文
共 17 条
[1]  
Agaiby H., 1997, ESCA EUROSPEECH 97, P1119
[2]  
[Anonymous], EUR 93 BERL GERM
[3]  
Berouti M., 1979, ICASSP 79. 1979 IEEE International Conference on Acoustics, Speech and Signal Processing, P208
[4]  
BURLEY S, 1997, INT C AC SPEECH SIGN, P83
[5]  
BURSTEIN E, 1997, ROBUST SPEECH RECOGN, P111
[6]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996
[7]   DE-NOISING BY SOFT-THRESHOLDING [J].
DONOHO, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1995, 41 (03) :613-627
[8]   The discrete multiple wavelet transform and thresholding methods [J].
Downie, TR ;
Silverman, BW .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (09) :2558-2561
[9]  
Hermansky H., 1993, ICASSP-93. 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No.92CH3252-4), P83, DOI 10.1109/ICASSP.1993.319236
[10]   A Robust Algorithm for Word Boundary Detection in the Presence of Noise [J].
Junqua, Jean-Claude ;
Mak, Brian ;
Reaves, Ben .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03) :406-412