Optimal feature vector for speech recognition of unequally segmented spoken digits

被引:0
作者
Karam, JR [1 ]
Phillips, WJ [1 ]
Robertson, W [1 ]
机构
[1] Dalhousie Univ, DalTech, Dept Engn Mech, Halifax, NS B3J 2X4, Canada
来源
2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA | 2000年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we describe a model obtained by applying the Discrete Wavelet Transform (DWT) to unequally segmented digits. Each signal is divided with a pre-determined segmentation into a maximum of five subwords. The purpose is speaker independent single digit recognition. The parameterization of the subwords is accomplished by measuring its energy contents after decomposing it with the DWT. This model uses one coefficient per subword and produces up to a 99% recognition rate. It is superior in its class due to the high reduction in the size of the feature vector and consequently in the speed of processing. Typically the reduction is 20:1 if compared with the traditional Melscale model. A successful attempt to classify vowels and accurately identify digits visibly using the proposed model is undertaken. A Radial Basis Function Artificial Neural Network (RBF-ANN) is employed for the recognition tasks and for the comparison of the proposed model with the Fourier one [2]. We use orthogonal wavelets from the Daubechies set. Also the performances of some biorthogonal wavelets are included.
引用
收藏
页码:327 / 330
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 1993, FUNDAMENTAL SPEECH R
[2]  
Daubechies I., 1993, Ten Lectures of Wavelets, V28, P350
[3]  
DAUBECHIES I, 6 MDSP WORKSH SESS T, P98
[4]  
GILBERT S, 1996, WAVELETS FILTER BANK
[5]  
KARAM JR, 2000, IN PRESS CCECE MAY
[6]  
MISITI M, 1997, MATLAB WAVELET TOOL
[7]  
*NIST, 1991, SPEECH DISCS STUD QU
[8]  
PHILLIPS JW, 1995, IEEE WESCANEX P
[9]  
PICONE JW, 1993, IEEE, V81
[10]  
RIOUL O, 1991, IEEE SIGNAL PROC OCT