Bayesian channel equalisation and robust features for speech recognition

被引:3
作者
Milner, BP
Vaseghi, SV
机构
[1] UNIV E ANGLIA, NORWICH NR4 7TJ, NORFOLK, ENGLAND
[2] QUEENS UNIV BELFAST, SCH ELECT ENGN & COMP SCI, BELFAST BT9 5AH, ANTRIM, NORTH IRELAND
来源
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING | 1996年 / 143卷 / 04期
关键词
Bayesian channel; speech recognition; microphone;
D O I
10.1049/ip-vis:19960577
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of a speech recognition system with telephone channel environments, or different microphones, requires channel equalisation. In speech recognition, the speech model provides a bank of statistical information that can be used in the channel identification and equalisation process. The authors consider HMM-based channel equalisation, and present results demonstrating that substantial improvement can be obtained through the equalisation process. An alternative method, for speech recognition, is to use a feature set which is more robust to channel distortion. Channel distortions result in an amplutude tilt of the speech cepstrum, and therefore differential cepstral features provide a measure of immunity to channel distortions. In particular the cepstral-time feature matrix, in addition to providing a framework for representing speech dynamics, call be made robust to channel distortions. The authors present results demonstrating that a major advantage of cepstral-time matrices is their channel insensitive character.
引用
收藏
页码:223 / 231
页数:9
相关论文
共 17 条
[1]  
[Anonymous], 1996, Advanced Signal Processing and Digital Noise Reduction
[2]  
Bellini S., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P2236, DOI 10.1109/ICASSP.1988.197081
[3]  
BELLINI S, 1986, IEEE GLOB TEL DEC, P1634
[4]   ROBUST IDENTIFICATION OF A NON-MINIMUM PHASE SYSTEM - BLIND ADJUSTMENT OF A LINEAR EQUALIZER IN DATA COMMUNICATIONS [J].
BENVENISTE, A ;
GOURSAT, M ;
RUGET, G .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1980, 25 (03) :385-399
[5]  
HANSON BA, 1993, 1993 P IEEE INT C AC, V2, P79
[6]  
HERMANSKY H, 1992, INT C SPOK LANG PROC, P85
[7]   TECHNIQUES FOR ADAPTIVE EQUALIZATION OF DIGITAL COMMUNICATION SYSTEMS [J].
LUCKY, RW .
BELL SYSTEM TECHNICAL JOURNAL, 1966, 45 (02) :255-+
[8]   TUTORIAL ON HIGHER-ORDER STATISTICS (SPECTRA) IN SIGNAL-PROCESSING AND SYSTEM-THEORY - THEORETICAL RESULTS AND SOME APPLICATIONS [J].
MENDEL, JM .
PROCEEDINGS OF THE IEEE, 1991, 79 (03) :278-305
[9]   COMPARISON OF SOME NOISE-COMPENSATION METHODS FOR SPEECH RECOGNITION IN ADVERSE ENVIRONMENTS [J].
MILNER, BP ;
VASEGHI, SV .
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (05) :280-288
[10]  
MOKBEL C, 1993, P 3 EUR C SPEECH COM, V2, P1247