Normalize d e dge convolutional networks for skeleton-based hand gesture recognition

被引:24
作者
Guo, Fangtai [1 ]
He, Zaixing [1 ,2 ]
Zhang, Shuyou [1 ]
Zhao, Xinyue [1 ]
Fang, Jinhui [1 ]
Tan, Jianrong [2 ]
机构
[1] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Skeleton-based hand gesture recognition; Edge-varying graph; Normalized edge convolution; Zig-zag sampling strategy;
D O I
10.1016/j.patcog.2021.108044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic hand skeletons consisting of discrete spatial-temporal finger joint clouds effectively convey the intentions of communicators. Previous graph convolutional networks (GCNs) relying on human handcrafted inductive biases have been quickly promoted for skeleton-based hand gesture recognition (SHGR). However, most existing graph constructions for GCN-based solutions are set manually, only considering the physical topology of the hand skeleton, and the fixed dependencies among hand joints may lead to suboptimal models. To enrich the local dependencies, we emphasize that hand skeletons can be seen from two views: explicit joint clouds and implicit skeleton topology. Starting from those two views of hand gestures, we attempt to introduce dynamics and diversities into the local neighborhood of the graph by dividing it into sets of physical neighbors, temporal neighbors and varying neighbors. Next, we systematically proceed with three innovations, including the novel edge-varying graph, normalized edge convolution operation, and zig-zag sampling strategy, to alleviate the challenges resulting from engineering practices. Finally, spatial-based GCNs called normalized edge convolutional networks are constructed for hand gesture recognition. Experiments on publicly available hand datasets show that our work is stable for performing state-of-the-art gesture recognition, and ablation experiments are also provided to validate each contribution. (c) 2021 Published by Elsevier Ltd.
引用
收藏
页数:15
相关论文
共 55 条
[1]   Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures [J].
Avola, Danilo ;
Bernardi, Marco ;
Cinque, Luigi ;
Foresti, Gian Luca ;
Massaroni, Cristiano .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (01) :234-245
[2]  
Biswas K.K, 2011, 5 INT C AUT ROB APPL, P100, DOI DOI 10.1109/ICARA.2011.6144864
[3]   MFA-Net: Motion Feature Augmented Network for Dynamic Hand Gesture Recognition from Skeletal Data [J].
Chen, Xinghao ;
Wang, Guijin ;
Guo, Hengkai ;
Zhang, Cairong ;
Wang, Hang ;
Zhang, Li .
SENSORS, 2019, 19 (02)
[4]  
Chen Y., 2019, ARXIV191210730PHYSIC
[5]   Graph convolutional network with structure pooling and joint-wise channel attention for action recognition [J].
Chen, Yuxin ;
Ma, Gaoqun ;
Yuan, Chunfeng ;
Li, Bing ;
Zhang, Hui ;
Wang, Fangshi ;
Hu, Weiming .
PATTERN RECOGNITION, 2020, 103
[6]  
De Smedt Q, 2017, Multimedia Tools and Applications, P1
[7]   Heterogeneous hand gesture recognition using 3D dynamic skeletal data [J].
De Smedt, Quentin ;
Wannous, Hazem ;
Vandeborre, Jean-Philippe .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 181 :60-72
[8]   Skeleton-based Dynamic hand gesture recognition [J].
De Smedt, Quentin ;
Wannous, Hazem ;
Vandeborre, Jean-Philippe .
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :1206-1214
[9]   Deep Learning for Hand Gesture Recognition on Skeletal Data [J].
Devineau, Guillaume ;
Xi, Wang ;
Moutarde, Fabien ;
Yang, Jie .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :106-113
[10]  
Du Y, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P579, DOI 10.1109/ACPR.2015.7486569