Bayesian channel equalisation and robust features for speech recognition

被引：3

作者：

Milner, BP

Vaseghi, SV

机构：

[1] UNIV E ANGLIA, NORWICH NR4 7TJ, NORFOLK, ENGLAND

[2] QUEENS UNIV BELFAST, SCH ELECT ENGN & COMP SCI, BELFAST BT9 5AH, ANTRIM, NORTH IRELAND

来源：

IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING | 1996年 / 143卷 / 04期

关键词：

Bayesian channel; speech recognition; microphone;

D O I：

10.1049/ip-vis:19960577

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The use of a speech recognition system with telephone channel environments, or different microphones, requires channel equalisation. In speech recognition, the speech model provides a bank of statistical information that can be used in the channel identification and equalisation process. The authors consider HMM-based channel equalisation, and present results demonstrating that substantial improvement can be obtained through the equalisation process. An alternative method, for speech recognition, is to use a feature set which is more robust to channel distortion. Channel distortions result in an amplutude tilt of the speech cepstrum, and therefore differential cepstral features provide a measure of immunity to channel distortions. In particular the cepstral-time feature matrix, in addition to providing a framework for representing speech dynamics, call be made robust to channel distortions. The authors present results demonstrating that a major advantage of cepstral-time matrices is their channel insensitive character.

引用

页码：223 / 231

页数：9

共 17 条

[1]

[Anonymous], 1996, Advanced Signal Processing and Digital Noise Reduction

[2]

Bellini S., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P2236, DOI 10.1109/ICASSP.1988.197081

[3]

BELLINI S, 1986, IEEE GLOB TEL DEC, P1634

[4] ROBUST IDENTIFICATION OF A NON-MINIMUM PHASE SYSTEM - BLIND ADJUSTMENT OF A LINEAR EQUALIZER IN DATA COMMUNICATIONS [J].