Convolutional neural network with spatial pyramid pooling for hand gesture recognition

被引:0
|
作者
Yong Soon Tan
Kian Ming Lim
Connie Tee
Chin Poo Lee
Cheng Yaw Low
机构
[1] Multimedia University,Faculty of Information Science and Technology (FIST)
来源
关键词
Convolutional neural network (CNN); Spatial pyramid pooling (SPP); Hand gesture recognition; Sign language recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture provides a means for human to interact through a series of gestures. While hand gesture plays a significant role in human–computer interaction, it also breaks down the communication barrier and simplifies communication process between the general public and the hearing-impaired community. This paper outlines a convolutional neural network (CNN) integrated with spatial pyramid pooling (SPP), dubbed CNN–SPP, for vision-based hand gesture recognition. SPP is discerned mitigating the problem found in conventional pooling by having multi-level pooling stacked together to extend the features being fed into a fully connected layer. Provided with inputs of varying sizes, SPP also yields a fixed-length feature representation. Extensive experiments have been conducted to scrutinize the CNN–SPP performance on two well-known American sign language (ASL) datasets and one NUS hand gesture dataset. Our empirical results disclose that CNN–SPP prevails over other deep learning-driven instances.
引用
收藏
页码:5339 / 5351
页数:12
相关论文
共 50 条
  • [21] Spatial Pyramid Pooling of Selective Convolutional Features for Vein Recognition
    Wang, Jun
    Pan, Zaiyu
    Wang, Guoqing
    Li, Ming
    Li, Yulian
    IEEE ACCESS, 2018, 6 : 28563 - 28572
  • [22] An optimized convolutional neural network with bottleneck and spatial pyramid pooling layers for classification of foods
    Jahani Heravi, Elnaz
    Habibi Aghdam, Hamed
    Puig, Domenec
    PATTERN RECOGNITION LETTERS, 2018, 105 : 50 - 58
  • [23] Gesture recognition of graph convolutional neural network based on spatial domain
    Chen, Hong
    Zhao, Hongdong
    Qi, Baoqiang
    Zhang, Shuai
    Yu, Zhanghong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2157 - 2167
  • [24] Gesture recognition of graph convolutional neural network based on spatial domain
    Hong Chen
    Hongdong Zhao
    Baoqiang Qi
    Shuai Zhang
    Zhanghong Yu
    Neural Computing and Applications, 2023, 35 : 2157 - 2167
  • [25] Hand gesture recognition via enhanced densely connected convolutional neural network
    Tan, Yong Soon
    Lim, Kian Ming
    Lee, Chin Poo
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 175
  • [26] Automated Hand Gesture Recognition using a Deep Convolutional Neural Network model
    Dhall, Ishika
    Vashisth, Shubham
    Aggarwal, Garima
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 811 - 816
  • [27] Hand Gesture Recognition Using an Adapted Convolutional Neural Network with Data Augmentation
    Alani, Ali A.
    Cosma, Georgina
    Taherkhani, Aboozar
    McGinnity, T. M.
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 5 - 12
  • [28] Performance Evaluation of Convolutional Neural Network for Hand Gesture Recognition Using EMG
    Asif, Ali Raza
    Waris, Asim
    Gilani, Syed Omer
    Jamil, Mohsin
    Ashraf, Hassan
    Shafique, Muhammad
    Niazi, Imran Khan
    SENSORS, 2020, 20 (06)
  • [29] Static Hand Gesture Recognition using Convolutional Neural Network with Data Augmentation
    Islam, Md Zahirul
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Anderssor, Karl
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 324 - 329
  • [30] A Convolutional Neural Network Model for Identifying Unclassified and Misclassified Vehicles Using Spatial Pyramid Pooling
    Bao, Jieyi
    Hu, Xiaoqiang
    Jiang, Yi
    Li, Shuo
    CONSTRUCTION RESEARCH CONGRESS 2022: COMPUTER APPLICATIONS, AUTOMATION, AND DATA ANALYTICS, 2022, : 956 - 966