Multi-Scale Feature-Based Spatiotemporal Pyramid Network for Hand Gesture Recognition

被引:0
作者
Cao, Zongjing [1 ]
Li, Yan [1 ]
Shin, Byeong-Seok [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea
基金
新加坡国家研究基金会;
关键词
Deep Learning; Hand Gesture Recognition; Pyramid Network; Spatiotemporal Feature;
D O I
10.22967/HCIS.2022.12.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively capturing the spatiotemporal features of hand gestures from sequence data is crucial for gesture recognition. Existing work has effectively obtained motion features from between neighboring frames through well-designed temporal modeling networks; however, less attention has been paid to the spatial information contained in each frame. These approaches ignore the implicit complementary advantages of multi-scale appearance representations, which are essential to gesture recognition. We propose a multi-scale, feature-based spatiotemporal pyramid network for hand gesture recognition. It has a top-down, lateral-connection architecture designed to fuse spatial and temporal features from multiple scales in each layer. The network first outputs a coarse feature in a feedforward pass and then refines this feature in the top-down pass using features from successive lower layers. Similar to skip connections, our approach uses features from each layer of the network, but does not attempt to output independent predictions in each layer. Furthermore, we introduce a spatiotemporal pyramid module formed by stacking multiple successive refinement modules to fuse the multi -scale spatial feature output from each layer. We evaluate the proposed model with two publicly available benchmark hand gesture datasets. The model achieved accuracies of 85.1% and 95.4% for depth modality in the NVGesture and EgoGesture datasets, respectively. The comparison results show that the proposed hand gesture recognition method outperforms existing state-of-the-art methods.
引用
收藏
页数:14
相关论文
共 50 条
[31]   Study on Satellite Signal Recognition with Multi-scale Feature Attention Network [J].
Li, Yun ;
Yang, Songlin ;
Xing, Zhitong ;
Wu, Guangfu ;
Ma, Hao .
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2025, 47 (06) :1792-1802
[32]   REAL TIME HAND GESTURE RECOGNITION VIA FINGER-EMPHASIZED MULTI-SCALE DESCRIPTION [J].
Yang, Jianyu ;
Zhu, Chen ;
Yuan, Junsong .
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, :631-636
[33]   Lightweight silkworm recognition based on Multi-scale feature fusion [J].
Wen, Chunming ;
Wen, Jie ;
Li, Jianheng ;
Luo, Yunyun ;
Chen, Minbo ;
Xiao, Zhanpeng ;
Xu, Qing ;
Liang, Xiang ;
An, Hui .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 200
[34]   Animal species detection and classification framework based on modified multi-scale attention mechanism and feature pyramid network [J].
Ukwuoma, Chiagoziem C. ;
Qin, Zhiguang ;
Yussif, Sophyani B. ;
Happy, Monday N. ;
Nneji, Grace U. ;
Urama, Gilbert C. ;
Ukwuoma, Chibueze D. ;
Darkwa, Nimo B. ;
Agobah, Harriet .
SCIENTIFIC AFRICAN, 2022, 16
[35]   Dynamic Hand Gesture Recognition Using Effective Feature Extraction and Attention Based Deep Neural Network [J].
Miah, Abu Saleh Musa ;
Shin, Jungpil ;
Hasan, Md. Al Mehedi ;
Okuyama, Yuichi ;
Nobuyoshi, Asai .
2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, :241-247
[36]   Hand Gesture Recognition Using Deep Feature Fusion Network Based on Wearable Sensors [J].
Yuan, Guan ;
Liu, Xiao ;
Yan, Qiuyan ;
Qiao, Shaojie ;
Wang, Zhixiao ;
Yuan, Li .
IEEE SENSORS JOURNAL, 2021, 21 (01) :539-547
[37]   MSAPVT: a multi-scale attention pyramid vision transformer network for large-scale fruit recognition [J].
Rao, Yao ;
Li, Chaofeng ;
Xu, Feiran ;
Guo, Ya .
JOURNAL OF FOOD MEASUREMENT AND CHARACTERIZATION, 2024, 18 (11) :9233-9251
[38]   A Hand Gesture Recognition Method Based on Multi-Feature Fusion and Template Matching [J].
Liu Yun ;
Zhang Lifeng ;
Zhang Shujun .
2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 :1678-1684
[39]   Lithography hotspot detection through multi-scale feature fusion utilizing feature pyramid network and dense block [J].
Xu, Hui ;
Yuan, Ye ;
Ma, Ruijun ;
Qi, Pan ;
Tang, Fuxin ;
Xiao, Xinzhong ;
Huang, Wenxin ;
Liang, Huaguo .
JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2024, 23 (01)
[40]   Simple feature pyramid network for weakly supervised object localization using multi-scale information [J].
Bongyeong Koo ;
Han-Soo Choi ;
Myungjoo Kang .
Multidimensional Systems and Signal Processing, 2021, 32 :1185-1197