Multi-Scale Feature-Based Spatiotemporal Pyramid Network for Hand Gesture Recognition

被引:0
作者
Cao, Zongjing [1 ]
Li, Yan [1 ]
Shin, Byeong-Seok [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
Deep Learning; Hand Gesture Recognition; Pyramid Network; Spatiotemporal Feature;
D O I
10.22967/HCIS.2022.12.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively capturing the spatiotemporal features of hand gestures from sequence data is crucial for gesture recognition. Existing work has effectively obtained motion features from between neighboring frames through well-designed temporal modeling networks; however, less attention has been paid to the spatial information contained in each frame. These approaches ignore the implicit complementary advantages of multi-scale appearance representations, which are essential to gesture recognition. We propose a multi-scale, feature-based spatiotemporal pyramid network for hand gesture recognition. It has a top-down, lateral-connection architecture designed to fuse spatial and temporal features from multiple scales in each layer. The network first outputs a coarse feature in a feedforward pass and then refines this feature in the top-down pass using features from successive lower layers. Similar to skip connections, our approach uses features from each layer of the network, but does not attempt to output independent predictions in each layer. Furthermore, we introduce a spatiotemporal pyramid module formed by stacking multiple successive refinement modules to fuse the multi -scale spatial feature output from each layer. We evaluate the proposed model with two publicly available benchmark hand gesture datasets. The model achieved accuracies of 85.1% and 95.4% for depth modality in the NVGesture and EgoGesture datasets, respectively. The comparison results show that the proposed hand gesture recognition method outperforms existing state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
[41]   Simple feature pyramid network for weakly supervised object localization using multi-scale information [J].
Koo, Bongyeong ;
Choi, Han-Soo ;
Kang, Myungjoo .
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2021, 32 (04) :1185-1197
[42]   Siamese Network Tracker Based on Multi-Scale Feature Fusion [J].
Zhao, Jiaxu ;
Niu, Dapeng .
SYSTEMS, 2023, 11 (08)
[43]   A Feature Pyramid Optical Flow Estimation Method Based on Multi-scale Deformable Convolution [J].
Fan B.-B. ;
Ge L.-Y. ;
Zhang C.-X. ;
Li B. ;
Feng C. ;
Chen Z. .
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (01) :197-209
[44]   SOLAR CELL DEFECT DETECTION NETWORK BASED ON MULTI-SCALE ASYMPTOTIC PYRAMID [J].
Zhu, Lei ;
Geng, Cuicui ;
Li, Botao ;
Pan, Yang ;
Zhang, Bo ;
Yao, Li'na .
Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2025, 46 (05) :267-274
[45]   Deep Loss Driven Multi-Scale Hashing Based on Pyramid Connected Network [J].
Gu, Lingchen ;
Liu, Ju ;
Liu, Xiaoxi ;
Sun, Jiande .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :939-954
[46]   An Improved Multi-Scale Feature Extraction Network for Rice Disease and Pest Recognition [J].
Lv, Pengtao ;
Xu, Heliang ;
Zhang, Yana ;
Zhang, Qinghui ;
Pan, Quan ;
Qin, Yao ;
Chen, Youyang ;
Cao, Dengke ;
Wang, Jingping ;
Zhang, Mengya ;
Chen, Cong .
INSECTS, 2024, 15 (11)
[47]   A Feature Pyramid Network Based Detection Approach for Multi-Scale Target Detection in Stationary Short-Range Radar [J].
Reitz, Philipp ;
Veihelmann, Tobias ;
Kunzle, Christian ;
Franchi, Norman ;
Weigel, Robert ;
Luebke, Maximilian .
2025 IEEE WIRELESS AND MICROWAVE TECHNOLOGY CONFERENCE, WAMICON, 2025,
[48]   Research on the Lip-Print Recognition Based on Multi-scale Feature [J].
Zhou, Hongcheng .
IOT AS A SERVICE, IOTAAS 2023, 2025, 585 :16-21
[49]   Wafer defect recognition method based on multi-scale feature fusion [J].
Chen, Yu ;
Zhao, Meng ;
Xu, Zhenyu ;
Li, Kaiyue ;
Ji, Jing .
FRONTIERS IN NEUROSCIENCE, 2023, 17
[50]   Hand gesture recognition based on HOG-LBP feature [J].
Zhang, Fan ;
Liu, Yue ;
Zou, Chunyu ;
Wang, Yongtian .
2018 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC): DISCOVERING NEW HORIZONS IN INSTRUMENTATION AND MEASUREMENT, 2018, :1974-1979