AUTOMATIC WORD RECOGNITION IN CARS

被引:32
作者
MOKBEL, CE [1 ]
CHOLLET, GFA [1 ]
机构
[1] TELECOM PARIS,CNRS,URA 820,PARIS,FRANCE
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 05期
关键词
D O I
10.1109/89.466660
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper compares, on a database recorded in a car, a number of signal analysis and speech enhancement techniques as well as some approaches to adapt speech recognition systems, It is shown that a new nonlinear spectral subtraction associated with Mel frequency cepstral coefficients (MFCC) is an adequate compromise for low-cost integration, The Lombard effect is analyzed and simulated, Such a simulation is used to derive realistic training utterances from noise-free utterances, Adapting a continuous-density hidden Markov model (CDHMM) to these artificially generated training samples yields a very high performance with respect to that achieved within the ESPRIT adverse environment recognition of speech (ARS) project, i,e,, an average of 1% error for an driving conditions, Finally, this paper shows, both theoretically and experimentally, that whatever the noise estimation technique is, it is better to add this noise estimate to the reference clean models than to subtract it from the noisy data.
引用
收藏
页码:346 / 356
页数:11
相关论文
共 48 条
[1]  
Lecomte I., Lever M., Boudy J., Tassy A., Car noise processing for speech input, Proc. ICASSP89, 1, (1989)
[2]  
Baillargeat C., Contribution a lAmelioration des Performances dun Radiotelephone Mains-Libres a Commande Vocale, Ph.D. thesis (in french), (1991)
[3]  
Mansour D., Juang B.H., The short-time modified coherence representation and noisy speech recognition, IEEE Trans. Acoust., 37, 6, pp. 795-804, (1989)
[4]  
Van Compernolle D., Ma W., Xie F., Van Diest M., Speech recognition in noisy environments with the aid of microphone arrays, Speech Comm., 9, 6, pp. 433-442, (1990)
[5]  
Lockwood P., Boudy J., Experiments with non-linear spectral subtractor (NSS), hidden Markov models, and the projection distance, for robust speech recognition in cars, Proc. Euro Speech91, 1, (1991)
[6]  
Van Summers W., Pisoni D.B., Bernacki R.H., Pedlow R.I., Stockes M.A., Effects of noise on speech production: Acoustic and perceptual analyses, J. Acoust. Soc. Amer., 84, 3, pp. 917-928, (1988)
[7]  
Stanton B.J., Jamieson L.H., Allen G.D., Robust recognition of loud and Lombard speech in the fighter cockpit environment, Proc. ICASSP 89, 1, (1989)
[8]  
Mokbel C., Reconnaissance de la Parole dans le Bruit: Bruitage/Debruitage, Ph.D. thesis (French), (1992)
[9]  
First data base recording, ARS Project, (1990)
[10]  
Lee Y.-T., Information-theoretic distortion measures for speech recognition, IEEE Trans. Signal Processing, 39, 2, pp. 330-335, (1991)