An Integrative Framework of Human Hand Gesture Segmentation for Human-Robot Interaction

被引:41
作者
Ju, Zhaojie [1 ]
Ji, Xiaofei [2 ]
Li, Jing [3 ,4 ]
Liu, Honghai [1 ]
机构
[1] Univ Portsmouth, Sch Comp, Portsmouth PO1 2UP, Hants, England
[2] Shenyang Aerosp Univ, Sch Automat, Shenyang 110136, Liaoning, Peoples R China
[3] Nanchang Univ, Sch Informat Engn, Nanchang 330047, Jiangxi, Peoples R China
[4] Nanchang Univ, Jiangxi Prov Key Lab Intelligent Informat Syst, Nanchang 330047, Jiangxi, Peoples R China
来源
IEEE SYSTEMS JOURNAL | 2017年 / 11卷 / 03期
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Alignment; hand gesture segmentation; human-computer interaction (HCI); RGB-depth (RGB-D); CAMERA CALIBRATION; KINECT SENSOR; RECOGNITION; DEPTH;
D O I
10.1109/JSYST.2015.2468231
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel framework to segment hand gestures in RGB-depth (RGB-D) images captured by Kinect using humanlike approaches for human-robot interaction. The goal is to reduce the error of Kinect sensing and, consequently, to improve the precision of hand gesture segmentation for robot NAO. The proposed framework consists of two main novel approaches. First, the depth map and RGB image are aligned by using the genetic algorithm to estimate key points, and the alignment is robust to uncertainties of the extracted point numbers. Then, a novel approach is proposed to refine the edge of the tracked hand gestures in RGB images by applying a modified expectation-maximization (EM) algorithm based on Bayesian networks. The experimental results demonstrate that the proposed alignment method is capable of precisely matching the depth maps with RGB images, and the EM algorithm further effectively adjusts the RGB edges of the segmented hand gestures. The proposed framework has been integrated and validated in a system of human-robot interaction to improve NAO robot's performance of understanding and interpretation.
引用
收藏
页码:1326 / 1336
页数:11
相关论文
共 45 条
[1]  
[Anonymous], 2011, 2011 IEEE APPL IM PA
[2]  
[Anonymous], PROC IEEE INT SYMP
[3]  
Berger K., 2011, VMV, P317, DOI DOI 10.2312/PE/VMV/VMV11/317-324
[4]  
Bilmes J. A., 1998, A Gentle Tutorial of the EM Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models, V4, P126
[5]  
Bin Liang, 2013, Lecture Notes on Software Engineering, V1, P339, DOI 10.7763/LNSE.2013.V1.73
[6]  
Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-7
[7]  
Cruz L., 2012, 2012 XXV SIBGRAPI Conference on Graphics, Patterns and Images Tutorials (SIBGRAPI-T), P36, DOI 10.1109/SIBGRAPI-T.2012.13
[8]  
Dollar P., 2005, Proceedings. 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS) (IEEE Cat. No. 05EX1178), P65
[9]   Combining multiple depth-based descriptors for hand gesture recognition [J].
Dominio, Fabio ;
Donadeo, Mauro ;
Zanuttigh, Pietro .
PATTERN RECOGNITION LETTERS, 2014, 50 :101-111
[10]  
Farid H., 1997, Computer Analysis of Images and Patterns. 7th International Conference, CAIP '97. Proceedings, P207, DOI 10.1007/3-540-63460-6_119