Efficient inception V2 based deep convolutional neural network for real-time hand action recognition

被引:35
作者
Bose, S. Rubin [1 ]
Kumar, V. Sathiesh [1 ]
机构
[1] Anna Univ, Dept Elect Engn, Madras Inst Technol Campus, Chennai, Tamil Nadu, India
关键词
convolution; neural nets; gesture recognition; image classification; feature extraction; learning (artificial intelligence); AP; Faster R-CNN Inception V2 model; real-time hand gesture recognition system; Efficient inception V2; real-time hand action recognition; effective network; accurate deep convolutional neural network; faster region-based convolutional neural network; standard data sets; NUS hand posture data set-II; IoU value; higher precision; SSD Inception V2 model; MITI-HD; 160; GESTURE RECOGNITION;
D O I
10.1049/iet-ipr.2019.0985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most effective and accurate deep convolutional neural network (faster region-based convolutional neural network (Faster R-CNN) Inception V2 model, single shot detector (SSD) Inception V2 model) based architectures for real-time hand gesture recognition is proposed. The proposed models are tested on standard data sets (NUS hand posture data set-II, Senz-3D) and custom-developed (MITI hand data set (MITI-HD)) data set. The performance metrics are analysed for intersection over union (IoU) ranges between 0.5 and 0.95. IoU value of 0.5 resulted in higher precision compared to other IoU values considered (0.5:0.95, 0.75). It is observed that the Faster R-CNN Inception V2 model resulted in higher precision (0.990 for AP(all), IoU = 0.5) compared to SSD Inception V2 model (0.984 for (all)) for MITI-HD 160. The computation time of Faster R-CNN Inception V2 is higher compared to SSD Inception V2 model and also resulted in less number of mispredictions. Increasing the size of samples (MITI-HD 300) resulted in improvement of AP(all) = 0.991. Improvement in large (APlarge) and medium (APmedium) size detections are not significant when compared to small (APsmall) detections. It is concluded that the Faster R-CNN Inception V2 model is highly suitable for real-time hand gesture recognition system under unconstrained environments.
引用
收藏
页码:688 / 696
页数:9
相关论文
共 50 条
  • [31] Real-Time Dorsal Hand Recognition Based on Smartphone
    Sayed, Mohamed, I
    Taha, Mohamed
    Zayed, Hala H.
    IEEE ACCESS, 2021, 9 : 151118 - 151128
  • [32] Real-Time Convolutional Neural Network-Based Speech Source Localization on Smartphone
    Kucuk, Abdullah
    Ganguly, Anshuman
    Hao, Yiya
    Panahi, Issa M. S.
    IEEE ACCESS, 2019, 7 : 169969 - 169978
  • [33] Real-Time Stress Assessment Using Sliding Window Based Convolutional Neural Network
    Naqvi, Syed Faraz
    Ali, Syed Saad Azhar
    Yahya, Norashikin
    Yasin, Mohd Azhar
    Hafeez, Yasir
    Subhani, Ahmad Rauf
    Adil, Syed Hasan
    Al Saggaf, Ubaid M.
    Moinuddin, Muhammad
    SENSORS, 2020, 20 (16) : 1 - 17
  • [34] A Real-Time Bridge Crack Detection Method Based on an Improved Inception-Resnet-v2 Structure
    Wang, Jinkang
    He, Xiaohui
    Faming, Shao
    Lu, Guanlin
    Cong, Hu
    Jiang, Qunyan
    IEEE ACCESS, 2021, 9 : 93209 - 93223
  • [35] An Efficient Feature Fusion of Graph Convolutional Networks and Its Application for Real-Time Traffic Control Gestures Recognition
    Dinh-Tan Pham
    Quang-Tien Pham
    Thi-Lan Le
    Hai Vu
    IEEE ACCESS, 2021, 9 : 121930 - 121943
  • [36] CSI-DeepNet: A Lightweight Deep Convolutional Neural Network Based Hand Gesture Recognition System Using Wi-Fi CSI Signal
    Kabir, M. Humayun
    Hasan, Md. Ali
    Shin, Wonjae
    IEEE ACCESS, 2022, 10 : 114787 - 114801
  • [37] A Sensorless Control System for an Implantable Heart Pump Using a Real-Time Deep Convolutional Neural Network
    Fetanat, Masoud
    Stevens, Michael
    Hayward, Christopher
    Lovell, Nigel H.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2021, 68 (10) : 3029 - 3038
  • [38] Real-Time Hand Tracking and Gesture Recognition Using Semantic-Probabilistic Network
    Kovalenko, Mykyta
    Antoshchuk, Svetlana
    Sieck, Juergen
    2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 269 - 274
  • [39] Human Activity Recognition Based on Gramian Angular Field and Deep Convolutional Neural Network
    Xu, Hongji
    Li, Juan
    Yuan, Hui
    Liu, Qiang
    Fan, Shidi
    Li, Tiankuo
    Sun, Xiaojie
    IEEE ACCESS, 2020, 8 (08): : 199393 - 199405
  • [40] Real-Time Virtual Lego Brick Manipulation Based on Hand Gesture Recognition
    Tran Van Thanh
    Kim, Dongho
    Jeong, Young-Sik
    Advanced Multimedia and Ubiquitous Engineering: Future Information Technology, 2015, 352 : 231 - 238