Spatial temporal graph convolutional networks for skeleton-based dynamic hand gesture recognition

被引:0
作者
Yong Li
Zihang He
Xiang Ye
Zuguo He
Kangrong Han
机构
[1] School of Science,
[2] Beijing University of Posts and Telecommunications,undefined
[3] School of Electronic Engineering,undefined
[4] Beijing University of Posts and Telecommunications,undefined
[5] Ye Peida School of Innovation and Enterprencurship,undefined
[6] Beijing University of Posts and Telecommunications,undefined
来源
EURASIP Journal on Image and Video Processing | / 2019卷
关键词
Graph convolutional network; Deep learning; Hand gesture recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture recognition methods play an important role in human-computer interaction. Among these methods are skeleton-based recognition techniques that seem to be promising. In literature, several methods have been proposed to recognize hand gestures with skeletons. One problem with these methods is that they consider little the connectivity between the joints of a skeleton, constructing simple graphs for skeleton connectivity. Observing this, we built a new model of hand skeletons by adding three types of edges in the graph to finely describe the linkage action of joints. Then, an end-to-end deep neural network, hand gesture graph convolutional network, is presented in which the convolution is conducted only on linked skeleton joints. Since the training dataset is relatively small, this work proposes expanding the coordinate dimensionality so as to let models learn more semantic features. Furthermore, relative coordinates are employed to help hand gesture graph convolutional network learn the feature representation independent of the random starting positions of actions. The proposed method is validated on two challenging datasets, and the experimental results show that it outperforms the state-of-the-art methods. Furthermore, it is relatively lightweight in practice for hand skeleton-based gesture recognition.
引用
收藏
相关论文
共 23 条
[1]  
Wang G.(2018)Region ensemble network: towards good practices for deep 3d hand pose estimation J. Vis. Commun. Image Represent. 55 404-414
[2]  
Chen X.(2014)Moddrop: Adaptive multi-modal gesture recognition IEEE Trans. Pattern. Anal. Mach. Intell. 38 1692-1706
[3]  
Guo H.(2016)A novel hierarchical framework for human action recognition Pattern Recognit. 55 148-159
[4]  
Zhang C.(2018)Convolutional neural networks and long short-term memory for skeleton-based human activity and hand gesture recognition Pattern Recognit. 76 80-96
[5]  
Neverova N.(2018)A fast uyghur text detector for complex background images IEEE Trans. Multimed 20 1-1
[6]  
Wolf C.(undefined)undefined undefined undefined undefined-undefined
[7]  
Taylor G.(undefined)undefined undefined undefined undefined-undefined
[8]  
Nebout F.(undefined)undefined undefined undefined undefined-undefined
[9]  
Chen H.(undefined)undefined undefined undefined undefined-undefined
[10]  
Wang G.(undefined)undefined undefined undefined undefined-undefined