A lightweight architecture for hand gesture recognition

被引:4
作者
Dang, Tuan Linh [1 ]
Pham, Trung Hieu [1 ]
Dang, Quang Minh [1 ]
Monet, Nicolas [2 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 01 Dai Co Viet Rd, Hanoi 100000, Vietnam
[2] NAVER CLOVA, Avatar, 6 Buljeong Ro, Seongnam Si, Gyeonggi Do, South Korea
关键词
Hand gesture recognition; Lightweight architecture; Segmentation; Classification;
D O I
10.1007/s11042-023-14550-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a lightweight architecture to recognize hand gestures that can be implemented in the resource-constrained device. There are two main components in our proposed architecture. The first component uses segmentation algorithms as preprocessing to remove noise and irrelevant parts from the input data, while the second component employs a classification algorithm to recognize hand gestures. Different lightweight segmentation and classification algorithms were also investigated and customized. Experimental results showed that the proposed lightweight architecture obtained high accuracy with various datasets even with noisy and complicated-background samples, especially with the combinations of DeepLabV3+ as the segmentation method and MobileNetV2 or EfficientNetB0 as the classification method. In addition, the inference speed of our lightweight system can achieve approximately 20 milliseconds with the fastest backbone even without using a high-end GPU.
引用
收藏
页码:28569 / 28587
页数:19
相关论文
共 40 条
[1]  
[Anonymous], 2019, MOBILENETV3 IMPLEMEN
[2]  
[Anonymous], 2020, DATABASE HAND GESTUR
[3]   Semantic scene segmentation in unstructured environment with modified DeepLabV3+ [J].
Baheti, Bhakti ;
Innani, Shubham ;
Gajre, Suhas ;
Talbar, Sanjay .
PATTERN RECOGNITION LETTERS, 2020, 138 :223-229
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]  
Chong Y., 2016, J SOFTW ENG APPL, V9, P103, DOI 10.4236/jsea.2016.94009
[6]   Segmentation of Brain Tumors Using DeepLabv3+ [J].
Choudhury, Ahana Roy ;
Vanguri, Rami ;
Jambawalikar, Sachin R. ;
Kumar, Piyush .
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 :154-167
[7]   HGR-Net: a fusion network for hand gesture segmentation and recognition [J].
Dadashzadeh, Amirhossein ;
Targhi, Alireza Tavakoli ;
Tahmasbi, Maryam ;
Mirmehdi, Majid .
IET COMPUTER VISION, 2019, 13 (08) :700-707
[8]  
Dang T., 2022, 2022 IEEE C EVOLUTIO, P1
[9]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[10]  
Elleuch H, 2015, INT CONF INTELL SYST, P195, DOI 10.1109/ISDA.2015.7489224