Multi-Scale Feature-Based Spatiotemporal Pyramid Network for Hand Gesture Recognition

被引:0
|
作者
Cao, Zongjing [1 ]
Li, Yan [1 ]
Shin, Byeong-Seok [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
Deep Learning; Hand Gesture Recognition; Pyramid Network; Spatiotemporal Feature;
D O I
10.22967/HCIS.2022.12.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively capturing the spatiotemporal features of hand gestures from sequence data is crucial for gesture recognition. Existing work has effectively obtained motion features from between neighboring frames through well-designed temporal modeling networks; however, less attention has been paid to the spatial information contained in each frame. These approaches ignore the implicit complementary advantages of multi-scale appearance representations, which are essential to gesture recognition. We propose a multi-scale, feature-based spatiotemporal pyramid network for hand gesture recognition. It has a top-down, lateral-connection architecture designed to fuse spatial and temporal features from multiple scales in each layer. The network first outputs a coarse feature in a feedforward pass and then refines this feature in the top-down pass using features from successive lower layers. Similar to skip connections, our approach uses features from each layer of the network, but does not attempt to output independent predictions in each layer. Furthermore, we introduce a spatiotemporal pyramid module formed by stacking multiple successive refinement modules to fuse the multi -scale spatial feature output from each layer. We evaluate the proposed model with two publicly available benchmark hand gesture datasets. The model achieved accuracies of 85.1% and 95.4% for depth modality in the NVGesture and EgoGesture datasets, respectively. The comparison results show that the proposed hand gesture recognition method outperforms existing state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multi-scale temporal feature-based dense convolutional network for action recognition
    Li, Xiaoqiang
    Xie, Miao
    Zhang, Yin
    Li, Jide
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (06)
  • [2] Real-time gesture recognition based on feature recalibration network with multi-scale information
    Cao, Zhengcai
    Xu, Xiaowen
    Hu, Biao
    Zhou, Meng
    Li, Qinglin
    NEUROCOMPUTING, 2019, 347 : 119 - 130
  • [3] A Convolutional Neural Network With Multi-scale Kernel and Feature Fusion for sEMG-based Gesture Recognition
    Han, Lijun
    Zou, Yongxiang
    Cheng, Long
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 774 - 779
  • [4] A new weighted multi-scale descriptor for hand gesture recognition
    Zhang, Beiwei
    Ding, Wen
    Ye, JiaSheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 43325 - 43347
  • [5] A new weighted multi-scale descriptor for hand gesture recognition
    Beiwei Zhang
    Wen Ding
    JiaSheng Ye
    Multimedia Tools and Applications, 2024, 83 : 43325 - 43347
  • [6] Hand gesture recognition using fast multi-scale analysis
    Fang, Yikai
    Cheng, Jian
    Wang, Kongqiao
    Lu, Hanqing
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON IMAGE AND GRAPHICS, 2007, : 694 - +
  • [7] FEATURE FUSING OF FEATURE PYRAMID NETWORK FOR MULTI-SCALE PEDESTRIAN DETECTION
    Tesema, Fiseha B.
    Lin, Junpeng
    Ou, Jie
    Wu, Hong
    Zhu, William
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 10 - 13
  • [8] Hand Gesture Recognition Based on Multi Feature Fusion
    Yang, Hongling
    Xuan, Shibin
    Mo, Yuanbin
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2018, PT II, 2018, 10942 : 389 - 398
  • [9] Facial Feature-Based Drowsiness Detection With Multi-Scale Convolutional Neural Network
    Vijaypriya, V.
    Uma, Mohan
    IEEE ACCESS, 2023, 11 : 63417 - 63429
  • [10] Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet
    Shao, Xiaotao
    Wang, Qing
    Yang, Wei
    Chen, Yun
    Xie, Yi
    Shen, Yan
    Wang, Zhongli
    SENSORS, 2021, 21 (05) : 1 - 15