Locally regularized sliced inverse regression based 3D hand gesture recognition on a dance robot

被引:14
作者
Cheng, Jun [1 ,2 ,3 ]
Bian, Wei [4 ]
Tao, Dacheng [4 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[3] Guangdong Prov Key Lab Robot & Intelligent Syst, Shenzhen, Peoples R China
[4] Univ Technol, Fac Engn & Informat Technol, Ctr Quantum Computat & Intelligent Syst, Ultimo, NSW 2007, Australia
关键词
Hand gesture recognition; Sliced inverse regression; Human-machine interaction (HMI); Multimedia entertainment;
D O I
10.1016/j.ins.2012.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gesture recognition plays an important role in human machine interactions (HMIs) for multimedia entertainment. In this paper, we present a dimension reduction based approach for dynamic real-time hand gesture recognition. The hand gestures are recorded as acceleration signals by using a handheld with a 3-axis accelerometer sensor installed, and represented by discrete cosine transform (DCT) coefficients. To recognize different hand gestures, we develop a new dimension reduction method, locally regularized sliced inverse regression (LR-SIR), to find an effective low dimensional subspace, in which different hand gestures are well separable, following which recognition can be performed by using simple and efficient classifiers, e.g., nearest mean, k-nearest-neighbor rule and support vector machine. LR-SIR is built upon the well-known sliced inverse regression (SIR), but overcomes its limitation that it ignores the local geometry of the data distribution. Besides, LR-SIR can be effectively and efficiently solved by eigen-decomposition. Finally, we apply the LR-SIR based gesture recognition to control our recently developed dance robot for multimedia entertainment. Thorough empirical studies on 'digits'-gesture recognition suggest the effectiveness of the new gesture recognition scheme for HMI. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:274 / 283
页数:10
相关论文
共 37 条
[1]   A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation [J].
Alon, Jonathan ;
Athitsos, Vassilis ;
Yuan, Quan ;
Sclaroff, Stan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (09) :1685-1699
[2]  
[Anonymous], CORR
[3]  
[Anonymous], 2009, ADV NEURAL INF PROCE
[4]   A new gesture recognition algorithm and segmentation method of Korean scripts for gesture-allowed ink editor [J].
Cho, MG .
INFORMATION SCIENCES, 2006, 176 (09) :1290-1303
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]   DigitalBeing -: using the environment as an expressive medium for dance [J].
El-Nasr, Magy Self ;
Vasilakos, Athanasios V. .
INFORMATION SCIENCES, 2008, 178 (03) :663-678
[7]  
Gao Y, 2012, IEEE ANTENNAS PROP
[8]   Ensemble Manifold Regularization [J].
Geng, Bo ;
Tao, Dacheng ;
Xu, Chao ;
Yang, Linjun ;
Hua, Xian-Sheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) :1227-1233
[9]   Parallel Lasso for Large-Scale Video Concept Detection [J].
Geng, Bo ;
Li, Yangxi ;
Tao, Dacheng ;
Wang, Meng ;
Zha, Zheng-Jun ;
Xu, Chao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) :55-65
[10]   DAML: Domain Adaptation Metric Learning [J].
Geng, Bo ;
Tao, Dacheng ;
Xu, Chao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (10) :2980-2989