Visual speech recognition using wavelet transform and moment based features

被引:0
作者
Yau, Wai C. [1 ]
Kumar, Dinesh K. [1 ]
Arjunan, Sridhar P. [1 ]
Kumar, Sanjay [1 ]
机构
[1] RMIT Univ, Sch Elect & Comp Engn, GPO Box 2476V, Melbourne, Vic 3001, Australia
来源
ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION | 2006年
关键词
Visual Speech Recognition; Motion History Image; Discrete Stationary Wavelet Transform; Image Moments; Artificial Neural Network;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel vision based approach to identify utterances consisting of consonants. A view based method is adopted to represent the 3-D image sequence of the mouth movement in a 2-D space using grayscale images named as motion history image (MHI). MHI is produced by applying accumulative image differencing technique on the sequence of images to implicitly capture the temporal information of the mouth movement. The proposed technique combines Discrete Stationary Wavelet Transform (SWT) and image moments to classify the MHI. A 2-D SWT at level 1 is applied to decompose MHI to produce one approximate and three detail sub images. The paper reports on the testing of the classification accuracy of three different moment-based features, namely Zernike moments, geometric moments and Hu moments computed from the approximate representation of MHI. Supervised feed forward multilayer perceptron (MLP) type artificial neural network (ANN) with back propagation learning algorithm is used to classify the moment-based features. The performance and image representation ability of the three moments features are compared in this paper. The preliminary results show that all these moments can achieve high recognition rate in classification of 3 consonants.
引用
收藏
页码:340 / 345
页数:6
相关论文
共 16 条
[1]  
[Anonymous], ARTIFICIAL NEURAL NE
[2]  
Bishop C. M., 1996, Neural networks for pattern recognition
[3]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267
[4]  
Chen TH, 2001, IEEE SIGNAL PROC MAG, V18, P9
[5]  
HAUNG KY, 2001, IIJCNN 01 INT JOINT
[6]   VISUAL-PATTERN RECOGNITION BY MOMENT INVARIANTS [J].
HU, M .
IRE TRANSACTIONS ON INFORMATION THEORY, 1962, 8 (02) :179-&
[7]  
KHONTANZAD A, 1990, PATTERN RECOGN, V23, P1089
[8]   Visual hand gestures classification using wavelet transform and moment based features [J].
Kumar, S ;
Kumar, DK .
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2005, 3 (01) :79-101
[9]  
KUMAR S, 2004, INTELLIGENT SENSORS
[10]  
LIANG L, 2002, IEEE INT C MULT EXP