Human gesture recognition of dynamic skeleton using graph convolutional networks

被引:1
|
作者
Liang, Wuyan [1 ]
Xu, Xiaolong [2 ]
Xiao, Fu [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
intelligent vision computing; graph convolutional networks; spatiotemporal correlations; dynamic gesture recognition; SIGN-LANGUAGE RECOGNITION;
D O I
10.1117/1.JEI.32.2.021402
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this era, intelligent vision computing has always been a fascinating field. With the rapid development in computer vision, dynamic gesture-based recognition systems have attracted significant attention. However, automatically recognizing skeleton-based human gestures in the form of sign language is complex and challenging. Most existing methods consider skeleton-based human gesture recognition as a standard video recognition problem, without considering the rich structure information among both joints and gesture frames. Graph convolutional networks (GCNs) are a promising way to leverage structure information to learn structure representations. However, adopting GCNs to tackle such gesture sequences both in spatial and temporal spaces is challenging as graph could be highly nonlinear and complex. To overcome this issue, we propose the spatiotemporal GCNs model to leverage the powerful spatiotemporal correlations to adaptively construct spatiotemporal graphs, called Aegles. Our method could dynamically attend to relatively significant spatiotemporal joints and construct different graphs, including spatial, temporal, and spatiotemporal graph, and well capturing the structure information in gesture sequences. Besides, we introduce the second-order information of the gesture skeleton data, i.e., the length and orientation of bones, to improve the representation of human hands and fingers. In addition, with the public sign language datasets, we use OpenPose technology to extract human gesture skeleton and obtain human skeleton video, building four skeleton-based sign language recognition datasets. Experimental results show that this Aegles outperforms the state-of-the-art ones and that the spatiotemporal correlations effectively boost the performance of human gesture recognition.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] Dyadic relational graph convolutional networks for skeleton-based human interaction recognition
    Zhu, Liping
    Wan, Bohua
    Li, Chengyang
    Tian, Gangyi
    Hou, Yi
    Yuan, Kun
    PATTERN RECOGNITION, 2021, 115
  • [12] Dynamic hand gesture recognition using the skeleton of the hand
    Ionescu, B
    Coquin, D
    Lambert, P
    Buzuloiu, V
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (13) : 2101 - 2109
  • [13] Dynamic Hand Gesture Recognition Using the Skeleton of the Hand
    Bogdan Ionescu
    Didier Coquin
    Patrick Lambert
    Vasile Buzuloiu
    EURASIP Journal on Advances in Signal Processing, 2005
  • [14] Multi-filter dynamic graph convolutional networks for skeleton- based action recognition
    Yuan, Yating
    Yu, Bo
    Wang, Wei
    Yu, Bihui
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 572 - 578
  • [15] Skeleton action recognition using Two-Stream Adaptive Graph Convolutional Networks
    Lee, James
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [16] A Survey on Skeleton-Based Activity Recognition using Graph Convolutional Networks (GCN)
    Manuel, Mesafint
    Yuan, Xiaohong
    Kim, Hyung Nam
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 177 - 182
  • [17] Bidirectional Skeleton-Based Isolated Sign Recognition using Graph Convolutional Networks
    Dafnis, Konstantinos M.
    Chroni, Evgenia
    Neidle, Carol
    Metaxas, Dimitris N.
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7328 - 7338
  • [18] Human Skeleton Graph Attention Convolutional for Video Action Recognition
    Zhang, Deyuan
    Gao, Hongwei
    Dai, Hailong
    Shi, Xiangbin
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 183 - 187
  • [19] DYNAMIC GESTURE DESIGN AND RECOGNITION FOR HUMAN-ROBOT COLLABORATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Chen, Haodong
    Tao, Wenjin
    Leu, Ming C.
    Yin, Zhaozheng
    PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON FLEXIBLE AUTOMATION (ISFA2020), 2020,
  • [20] Reduced Skeleton Representation for Action Recognition on Graph Convolutional Neural Networks
    Germann, Ida
    Memmesheimer, Raphael
    Paulus, Dietrich
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,