Spatial and Rotation Invariant 3D Gesture Recognition Based on Sparse Representation

被引:0
作者
Argelaguet, Ferran [1 ]
Ducoffe, Melanie [2 ]
Lecuyer, Anatole [1 ]
Gribonval, Remi [1 ]
机构
[1] Inria, IRISA, Rocquencourt, France
[2] ENS Rennes, Rennes, France
来源
2017 IEEE SYMPOSIUM ON 3D USER INTERFACES (3DUI) | 2017年
关键词
I.5.2 [Pattern Recognition]: Design Methodology; Classifier design and evaluation; I.6.3 [Computing Methodologies]: Methodologies and Techniques; Interaction Techniques; DESIGN;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in motion tracking technology, especially for commodity hardware, still require robust 3D gesture recognition in order to fully exploit the benefits of natural user interfaces. In this paper, we introduce a novel 3D gesture recognition algorithm based on the sparse representation of 3D human motion. The sparse representation of human motion provides a set of features that can be used to efficiently classify gestures in real-time. Compared to existing gesture recognition systems, sparse representation, the proposed approach enables full spatial and rotation invariance and provides high tolerance to noise. Moreover, the proposed classification scheme takes into account the inter-user variability which increases gesture classification accuracy in user-independent scenarios. We validated our approach with existing motion databases for gestural interaction and performed a user evaluation with naive subjects to show its robustness to arbitrarily defined gestures. The results showed that our classification scheme has high classification accuracy for user-independent scenarios even with users who have different handedness. We believe that sparse representation of human motion will pave the way for a new generation of 3D gesture recognition systems in order to fully open the potential of natural user interfaces.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 41 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], P 14 ACM INT C MULT
[3]  
[Anonymous], 2013, ISRN ARTIFICIAL INTE, DOI DOI 10.1155/2013/514641
[4]   Multimodal fusion for multimedia analysis: a survey [J].
Atrey, Pradeep K. ;
Hossain, M. Anwar ;
El Saddik, Abdulmotaleb ;
Kankanhalli, Mohan S. .
MULTIMEDIA SYSTEMS, 2010, 16 (06) :345-379
[5]   Decomposition and dictionary learning for 3D trajectories [J].
Barthelemy, Q. ;
Larue, A. ;
Mars, J. I. .
SIGNAL PROCESSING, 2014, 98 :423-437
[6]   3D Gesture classification with linear acceleration and angular velocity sensing devices for video games [J].
Cheema, Salman ;
Hoffman, Michael ;
LaViola, Joseph J., Jr. .
ENTERTAINMENT COMPUTING, 2013, 4 (01) :11-24
[7]   Feature Processing and Modeling for 6D Motion Gesture Recognition [J].
Chen, Mingyu ;
AlRegib, Ghassan ;
Juang, Biing-Hwang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (03) :561-571
[8]  
Chen Mingyu., 2012, Proceedings of the 3rd Multimedia Systems Conference, P83, DOI [10.1145/2155555.2155569, DOI 10.1145/2155555.2155569]
[9]   ORTHOGONAL LEAST-SQUARES METHODS AND THEIR APPLICATION TO NON-LINEAR SYSTEM-IDENTIFICATION [J].
CHEN, S ;
BILLINGS, SA ;
LUO, W .
INTERNATIONAL JOURNAL OF CONTROL, 1989, 50 (05) :1873-1896
[10]   Gesture recognition using a depth camera for human robot collaboration on assembly line [J].
Coupete, Eva ;
Moutarde, Fabien ;
Manitsaris, Sotiris .
6TH INTERNATIONAL CONFERENCE ON APPLIED HUMAN FACTORS AND ERGONOMICS (AHFE 2015) AND THE AFFILIATED CONFERENCES, AHFE 2015, 2015, 3 :518-525