Single Shot Detector CNN and Deep Dilated Masks for Vision-Based Hand Gesture Recognition From Video Sequences

被引:3
作者
Al Farid, Fahmid [1 ]
Hashim, Noramiza [2 ]
Bin Abdullah, Junaidi [2 ]
Bhuiyan, Md. Roman [2 ]
Kairanbay, Magzhan [3 ]
Yusoff, Zulfadzli [1 ]
Karim, Hezerul Abdul [1 ]
Mansor, Sarina [1 ]
Sarker, MD. Tanjil [1 ]
Ramasamy, Gobbi [1 ]
机构
[1] Multimedia Univ, Fac Engn, Cyberjaya 63100, Malaysia
[2] Multimedia Univ, Fac Comp & Informat, Cyberjaya 63100, Malaysia
[3] Suleyman Demirel Univ SDU, Fac Engn & Nat Sci, Alma Ata 32260, Kazakhstan
关键词
Gesture recognition; Support vector machines; Human computer interaction; Streaming media; Multimedia computing; Convolutional neural networks; Deep learning; video sequences; SVM; SSD-CNN; deep dilated mask;
D O I
10.1109/ACCESS.2024.3360857
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With an increasing number of people on the planet today, innovative human-computer interaction technologies and approaches may be employed to assist individuals in leading more fulfilling lives. Gesture-based technology has the potential to improve the safety and well-being of impaired people, as well as the general population. Recognizing gestures from video streams is a difficult problem because of the large degree of variation in the characteristics of each motion across individuals. In this article, we propose applying deep learning methods to recognize automated hand gestures using RGB and depth data. To train neural networks to detect hand gestures, any of these forms of data may be utilized. Gesture-based interfaces are more natural, intuitive, and straightforward. Earlier study attempted to characterize hand motions in a number of contexts. Our technique is evaluated using a vision-based gesture recognition system. In our suggested technique, image collection starts with RGB video and depth information captured with the Kinect sensor and is followed by tracking the hand using a single shot detector Convolutional Neural Network (SSD-CNN). When the kernel is applied, it creates an output value at each of the m $\times $ n locations. Using a collection of convolutional filters, each new feature layer generates a defined set of gesture detection predictions. After that, we perform deep dilation to make the gesture in the image masks more visible. Finally, hand gestures have been detected using the well-known classification technique SVM. Using deep learning we recognize hand gestures with higher accuracy of 93.68% in RGB passage, 83.45% in the depth passage, and 90.61% in RGB-D conjunction on the SKIG dataset compared to the state-of-the-art. In the context of our own created Different Camera Orientation Gesture (DCOG) dataset we got higher accuracy of 92.78% in RGB passage, 79.55% in the depth passage, and 88.56% in RGB-D conjunction for the gestures collected in 0-degree angle. Moreover, the framework intends to use unique methodologies to construct a superior vision-based hand gesture recognition system.
引用
收藏
页码:28564 / 28574
页数:11
相关论文
共 30 条
  • [1] Deep learning in vision-based static hand gesture recognition
    Oyedotun, Oyebade K.
    Khashman, Adnan
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (12) : 3941 - 3951
  • [2] Vision-based Hand Gesture Recognition from RGB Video Data Using SVM
    Al Farid, Fahmid
    Hashim, Noramiza
    Abdullah, Junaidi
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [3] Deep learning in vision-based static hand gesture recognition
    Oyebade K. Oyedotun
    Adnan Khashman
    Neural Computing and Applications, 2017, 28 : 3941 - 3951
  • [4] Vision-based hand gesture recognition using deep learning for the interpretation of sign language
    Sharma, Sakshi
    Singh, Sukhwinder
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182 (182)
  • [5] Recent methods in vision-based hand gesture recognition
    Badi H.
    Badi, Haitham (haitham@siswa.um.edu.my), 1600, Springer Science and Business Media Deutschland GmbH (01): : 77 - 87
  • [6] Survey on vision-based dynamic hand gesture recognition
    Tripathi, Reena
    Verma, Bindu
    VISUAL COMPUTER, 2024, 40 (09) : 6171 - 6199
  • [7] A Structured and Methodological Review on Vision-Based Hand Gesture Recognition System
    Al Farid, Fahmid
    Hashim, Noramiza
    Abdullah, Junaidi
    Bhuiyan, Md Roman
    Isa, Wan Noor Shahida Mohd
    Uddin, Jia
    Haque, Mohammad Ahsanul
    Husen, Mohd Nizam
    JOURNAL OF IMAGING, 2022, 8 (06)
  • [8] Recent methods and databases in vision-based hand gesture recognition: A review
    Pisharady, Pramod Kumar
    Saerbeck, Martin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 : 152 - 165
  • [9] Vision-based Hand Tracking and Gesture Recognition for Augmented Assembly System
    Wu, Y. M.
    He, H. W.
    Sun, J.
    Ru, T.
    Zheng, D. T.
    MANUFACTURING AUTOMATION TECHNOLOGY, 2009, 392-394 : 1030 - 1036
  • [10] Literature review of vision-based dynamic gesture recognition using deep learning techniques
    Jain, Rahul
    Karsh, Ram Kumar
    Barbhuiya, Abul Abbas
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (22)