Spatio-temporal feature-extraction techniques for isolated gesture recognition in Arabic Sign Language

被引:80
作者
Shanableh, Tamer [1 ]
Assaleh, Khaled
Al-Rousan, M.
机构
[1] Amer Univ Sharjah, Dept Comp Sci, UAE, Sharjah, U Arab Emirates
[2] Amer Univ Sharjah, Dept Elect Engn, UAE, Sharjah, U Arab Emirates
[3] Jordam Univ Sci & Technol, Dept Comp Engn, Irbid 22110, Jordan
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2007年 / 37卷 / 03期
关键词
feature extraction; motion analysis; pattern classification; visual languages;
D O I
10.1109/TSMCB.2006.889630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents various spatio-temporal feature-extraction techniques with applications to online and offline recognitions of isolated Arabic Sign Language gestures. The temporal features of a video-based gesture are extracted through forward, backward, and bidirectional predictions. The prediction errors are thresholded and accumulated into one image that represents the motion of the sequence. The motion representation is then followed by spatial-domain feature extractions. As such, the temporal dependencies are eliminated and the whole video sequence is represented by a few coefficients. The linear separability of the extracted features is assessed, and its suitability for both parametric and nonparametric classification techniques is elaborated upon. The proposed feature-extraction scheme was complemented by simple classification techniques, namely, K nearest neighbor (KNN) and Bayesian, i.e., likelihood ratio, classifiers. Experimental results showed classification performance ranging from 97% to 100% recognition rates. To validate our proposed technique, we have conducted a series of experiments using the classical way of classifying data with temporal dependencies, namely, hidden Markov models (HMMs). Experimental results revealed that the proposed feature-extraction scheme combined with simple KNN or Bayesian classification yields comparable results to the classical HMM-based scheme. Moreover, since the proposed scheme compresses the motion information of an image sequence into a single image, it allows for using simple classification techniques where the temporal dimension is eliminated. This is actually advantageous for both computational and storage requirements of the classifier.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 29 条
[1]   Recognition of gestures in Arabic sign language using neuro-fuzzy systems [J].
Al-Jarrah, O ;
Halawani, A .
ARTIFICIAL INTELLIGENCE, 2001, 133 (1-2) :117-138
[2]  
Al-Rousan M., 2001, International Journal of Computers and Their Applications, V8, P80
[3]  
[Anonymous], PSYCHOL PERSPECTIVES
[4]  
[Anonymous], 2005, 1449610 ISOIEC
[5]  
[Anonymous], 138182 ISOIEC
[6]  
[Anonymous], 2002, INT C VIS INTERFACE
[7]   Recognition of Arabic sign language alphabet using polynomial classifiers [J].
Assaleh, K ;
Al-Rousan, M .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (13) :2136-2145
[8]  
Bauer B, 2002, INT C PATT RECOG, P434, DOI 10.1109/ICPR.2002.1048332
[9]  
Bauer B., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P440, DOI 10.1109/AFGR.2000.840672
[10]   Hand gesture recognition using a real-time tracking method and hidden Markov models [J].
Chen, FS ;
Fu, CM ;
Huang, CL .
IMAGE AND VISION COMPUTING, 2003, 21 (08) :745-758