Gesture recognition of graph convolutional neural network based on spatial domain

被引:4
作者
Chen, Hong [1 ,2 ]
Zhao, Hongdong [1 ]
Qi, Baoqiang [3 ]
Zhang, Shuai [2 ]
Yu, Zhanghong [2 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300130, Peoples R China
[2] Hebei Normal Univ Sci & Technol, Sch Math & Informat Technol, Qinhuangdao 066004, Hebei, Peoples R China
[3] Qinhuangdao Inst Technol, Dept Informat Engn, Qinhuangdao 066100, Hebei, Peoples R China
关键词
Graph convolutional neural network; Gesture recognition; Graph-SAGE model; Cascade classifier;
D O I
10.1007/s00521-022-07040-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the iterative update of computer technology, the penetration of computer and other Internet technologies in human-computer interaction systems has become more and more extensive, and the human-computer interaction methods have quietly undergone huge changes. Gesture recognition has gradually become a hot spot in the field of human-computer interaction now, which has a wide range of application prospects and research value. The color segmentation experiment shows that the skin color of the gesture in the YCrCg space has better clustering properties than in the YCrCb space. In the preprocessing of gesture images, an improved Otsu method is proposed to improve the real-time performance to realize the threshold segmentation of the human hand; then the morphological processing is carried out, and the median filter method is used to achieve image denoising to improve image quality. A gesture recognition algorithm is designed: First, use Graph-SAGE to recognize the graph-structured data of the gesture, and then use the Adaboost algorithm to combine the two strong classifiers of the random forest and the support vector machine into a cascade classifier through the cascade structure. The output information of Graph-SAGE is classified and the meaning of the gesture is analyzed. On the test set, the average detection accuracy of the algorithm is 91.70%, the recall rate is 94.23%, and the average detection time per frame is 330 ms.
引用
收藏
页码:2157 / 2167
页数:11
相关论文
共 18 条
[1]   A deep convolutional neural network model for hand gesture recognition in 2D near-infrared images [J].
Can, Celal ;
Kaya, Yasin ;
Kilic, Fatih .
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2021, 7 (05)
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]  
Cipolla R., 1993, [1993] Proceedings Fourth International Conference on Computer Vision, P374, DOI 10.1109/ICCV.1993.378190
[4]   VISUAL GESTURE RECOGNITION [J].
DAVIS, J ;
SHAH, M .
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (02) :101-106
[5]  
Fukumoto M, 1999, IAPR WORKSH MACH VIS, P473
[6]  
Guler O, 2021, ARAB J SCI ENG, V7
[7]  
Hu T H, 2017, Fa Yi Xue Za Zhi, V33, P629, DOI 10.3969/j.issn.1004-5619.2017.06.013
[8]   Remote sensing image recognition for vehicles based on self-feedback template extraction [J].
Li, Shi-Wu ;
Xu, Yi ;
Sun, Wen-Cai ;
Yang, Zhong-Kai ;
Guo, Meng-Zhu ;
Yang, Liang-Kun ;
Yu, Xiao-Dong ;
Wang, De-Qiang .
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2014, 42 (05) :97-102
[9]  
Liu ChangQing Liu ChangQing, 2014, Transactions of the Chinese Society of Agricultural Engineering, V30, P131
[10]  
[刘涵 Liu Han], 2018, [仪器仪表学报, Chinese Journal of Scientific Instrument], V39, P247