STA-GCN: two-stream graph convolutional network with spatial-temporal attention for hand gesture recognition

被引:36
|
作者
Zhang, Wei [1 ,2 ]
Lin, Zeyi [1 ,2 ]
Cheng, Jian [1 ,2 ]
Ma, Cuixia [1 ,2 ]
Deng, Xiaoming [1 ,2 ]
Wang, Hongan [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing Key Lab Human Comp Interact, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
来源
VISUAL COMPUTER | 2020年 / 36卷 / 10-12期
关键词
Hand gesture recognition; Graph convolutional network; Spatial-temporal attention;
D O I
10.1007/s00371-020-01955-w
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Skeleton-based hand gesture recognition is an active research topic in computer graphics and computer vision and has a wide range of applications in VR/AR and robotics. Although the spatial-temporal graph convolutional network has been successfully used in skeleton-based hand gesture recognition, these works often use a fixed spatial graph according to the hand skeleton tree or use a fixed graph on the temporal dimension, which may not be optimal for hand gesture recognition. In this paper, we propose a two-stream graph attention convolutional network with spatial-temporal attention for hand gesture recognition. We adopt pose stream and motion stream as the two input streams for our network. In pose stream, we use the joint in each frame as the input; In motion stream, we use the joint offsets between neighboring frames as the input. We propose a new temporal graph attention module to model the temporal dependency and also use a spatial graph attention module to construct dynamic skeleton graph. For each stream, we adopt graph convolutional network with spatial-temporal attention to extract the features. Then, we concatenate the feature of the pose stream and motion stream for gesture recognition. We achieve the competitive performance on the main hand gesture recognition benchmark datasets, which demonstrates the effectiveness of our method.
引用
收藏
页码:2433 / 2444
页数:12
相关论文
共 50 条
  • [1] STA-GCN: two-stream graph convolutional network with spatial–temporal attention for hand gesture recognition
    Wei Zhang
    Zeyi Lin
    Jian Cheng
    Cuixia Ma
    Xiaoming Deng
    Hongan Wang
    The Visual Computer, 2020, 36 : 2433 - 2444
  • [2] STA-GCN:Spatial Temporal Adaptive Graph Convolutional Network for Gait Emotion Recognition
    Chen, Chuang
    Sun, Xiao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1385 - 1390
  • [3] STA-GCN: Spatial-Temporal Self-Attention Graph Convolutional Networks for Traffic-Flow Prediction
    Chang, Zhihong
    Liu, Chunsheng
    Jia, Jianmin
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [4] Skeleton-based emotion recognition based on two-stream self-attention enhanced spatial-temporal graph convolutional network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    Sensors (Switzerland), 2021, 21 (01): : 1 - 16
  • [5] Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    SENSORS, 2021, 21 (01) : 1 - 16
  • [6] Spatial-Temporal Graph Neural Network based Hand Gesture Recognition
    Yuan G.
    Bing R.
    Liu X.
    Dai W.
    Zhang Y.-M.
    Cai Z.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (04): : 921 - 931
  • [7] Two-Stream Spatial-Temporal Graph Convolutional Networks for Driver Drowsiness Detection
    Bai, Jing
    Yu, Wentao
    Xiao, Zhu
    Havyarimana, Vincent
    Regan, Amelia C.
    Jiang, Hongbo
    Jiao, Licheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13821 - 13833
  • [8] A Two-Stream Network For Driving Hand Gesture Recognition
    Zhou, Yefan
    Lv, Zhao
    Wang, Chaoqun
    Zhang, Shengli
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, : 553 - 560
  • [9] Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition
    Xia, Limin
    Fu, Weiye
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 11611 - 11626
  • [10] Two-stream Graph Attention Convolutional for Video Action Recognition
    Zhang, Deyuan
    Gao, Hongwei
    Dai, Hailong
    Shi, Xiangbin
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 23 - 27