Efficient inception V2 based deep convolutional neural network for real-time hand action recognition

被引:36
作者
Bose, S. Rubin [1 ]
Kumar, V. Sathiesh [1 ]
机构
[1] Anna Univ, Dept Elect Engn, Madras Inst Technol Campus, Chennai, Tamil Nadu, India
关键词
convolution; neural nets; gesture recognition; image classification; feature extraction; learning (artificial intelligence); AP; Faster R-CNN Inception V2 model; real-time hand gesture recognition system; Efficient inception V2; real-time hand action recognition; effective network; accurate deep convolutional neural network; faster region-based convolutional neural network; standard data sets; NUS hand posture data set-II; IoU value; higher precision; SSD Inception V2 model; MITI-HD; 160; GESTURE RECOGNITION;
D O I
10.1049/iet-ipr.2019.0985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most effective and accurate deep convolutional neural network (faster region-based convolutional neural network (Faster R-CNN) Inception V2 model, single shot detector (SSD) Inception V2 model) based architectures for real-time hand gesture recognition is proposed. The proposed models are tested on standard data sets (NUS hand posture data set-II, Senz-3D) and custom-developed (MITI hand data set (MITI-HD)) data set. The performance metrics are analysed for intersection over union (IoU) ranges between 0.5 and 0.95. IoU value of 0.5 resulted in higher precision compared to other IoU values considered (0.5:0.95, 0.75). It is observed that the Faster R-CNN Inception V2 model resulted in higher precision (0.990 for AP(all), IoU = 0.5) compared to SSD Inception V2 model (0.984 for (all)) for MITI-HD 160. The computation time of Faster R-CNN Inception V2 is higher compared to SSD Inception V2 model and also resulted in less number of mispredictions. Increasing the size of samples (MITI-HD 300) resulted in improvement of AP(all) = 0.991. Improvement in large (APlarge) and medium (APmedium) size detections are not significant when compared to small (APsmall) detections. It is concluded that the Faster R-CNN Inception V2 model is highly suitable for real-time hand gesture recognition system under unconstrained environments.
引用
收藏
页码:688 / 696
页数:9
相关论文
共 32 条
[1]   Hand Gesture Recognition Using Input Impedance Variation of Two Antennas with Transfer Learning [J].
Alnujaim, Ibrahim ;
Alali, Hashim ;
Khan, Faisal ;
Kim, Youngwook .
IEEE SENSORS JOURNAL, 2018, 18 (10) :4129-4135
[2]  
[Anonymous], 2017, The nus hand posture datasets i
[3]  
[Anonymous], ARXIV151202325CSCV
[4]  
[Anonymous], 2015, IEEE INT CONF AUTOMA
[5]  
[Anonymous], P CIRP C MAN SYST EL
[6]  
[Anonymous], 2017, Contemporary Engineering Sciences, DOI [10.12988/CES.2017.710154, DOI 10.12988/CES.2017.710154]
[7]  
[Anonymous], P INT C PATT REC ICP
[8]  
[Anonymous], IEEE C COMP VIS PATT
[9]  
[Anonymous], ARXIV190110323V2CSCV
[10]  
[Anonymous], 2017, arXiv:1611.10012