Speech recognition in a noisy car environment based on LP of the one-sided autocorrelation sequence and robust similarity measuring techniques

被引:23
作者
Hernando, J
Nadeu, C
Marino, JB
机构
[1] Dept. of Sign. Theor. and Commun., Polytech. University of Catalonia, Barcelona
关键词
speech recognition; noise robustness; feature extraction; spectral analysis of speech; distortion measures; vector quantization;
D O I
10.1016/S0167-6393(96)00074-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. A novel representation of the speech signal, which is based on Linear Prediction of the One-Sided Autocorrelation sequence (OSALPC), has shown to be attractive for noisy speech recognition because of both its high recognition performance with respect to the conventional LPC in severe conditions of additive white noise and its computational simplicity. The aim of this work is twofold: (1) to show that OSALPC also achieves a good performance in a case of real noisy speech (in a car environment), and (2) to explore its combination with several robust similarity measuring techniques, showing that its performance improves by using cepstral liftering, dynamic features and multilabeling.
引用
收藏
页码:17 / 31
页数:15
相关论文
共 25 条
[1]   ROOT CEPSTRAL ANALYSIS - A UNIFIED VIEW - APPLICATION TO SPEECH PROCESSING IN CAR NOISE ENVIRONMENTS [J].
ALEXANDRE, P ;
LOCKWOOD, P .
SPEECH COMMUNICATION, 1993, 12 (03) :277-288
[2]   SPECTRAL ESTIMATION - AN OVERDETERMINED RATIONAL MODEL EQUATION APPROACH [J].
CADZOW, JA .
PROCEEDINGS OF THE IEEE, 1982, 70 (09) :907-939
[3]   SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING DYNAMIC FEATURES OF SPEECH SPECTRUM [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (01) :52-59
[4]   SPECTRAL SLOPE DISTANCE MEASURES WITH LINEAR PREDICTION ANALYSIS FOR WORD RECOGNITION IN NOISE [J].
HANSON, BA ;
WAKITA, H .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (07) :968-973
[5]  
HERNANDO J, 1994, P INT C AC SPEECH SI, V2, P69
[6]  
HERNANDO J, 1993, THESIS POLYTECHNICAL
[7]  
HERNANDO J, 1992, P ICSLP 92 BANFF OCT, P1593
[8]  
HERNANDO J, 1991, P EUR 91 SEPT 1991 G, P91
[9]  
HERNANDO J, 1993, P EUR 93 BERL SEPT 1, P1643
[10]   PHONEME CLASSIFICATION USING SEMICONTINUOUS HIDDEN MARKOV-MODELS [J].
HUANG, XD .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (05) :1062-1067