Large-scale gesture recognition with a fusion of RGB-D data based on optical flow and the C3D model

被引:31
作者
Li, Yunan [1 ]
Miao, Qiguang [1 ]
Tian, Kuan [1 ]
Fan, Yingying [1 ]
Xu, Xin [1 ]
Ma, Zhenxin [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Gesture recognition; RGB-D data; Optical flow; 3D Convolutional Neural Networks;
D O I
10.1016/j.patrec.2017.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition has attracted great attention owing to its applications in many fields such as Human Computer Interaction. However, in video-based gesture recognition, some gesture-irrelevant factors like the background handicap the improvement of recognition rate. In this paper, we propose an effective 3D Convolutional Neural Network based method for large-scale gesture recognition using RGB-D video data. To obtain compact but with sufficient motion path information data for the network, the inputs are unified into 32-frame videos first. Then the optical flow images are constructed from the RGB videos frame by frame, to help with eliminating the disturbing background inside them. After that, the spatiotemporal features of de-background RGB and depth data are extracted with the C3D model (a 3D CNN model) respectively and blended together in the next stage according to the discriminant correlation analysis to boost the performance. Finally the classes are predicted with a linear SVM classifier. Our proposed method achieves 54.50% accuracy on the validation subset and 60.93% on the testing subset of the Chalearn LAP IsoGD dataset, both of which outperform our results (ranked 1st place) in the Chalearn LAP Large-scale Gesture Recognition Challenge. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:187 / 194
页数:8
相关论文
共 38 条
  • [1] ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition
    Wan, Jun
    Lin, Chi
    Wen, Longyin
    Li, Yunan
    Miao, Qiguang
    Escalera, Sergio
    Anbarjafari, Gholamreza
    Guyon, Isabelle
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3422 - 3433
  • [2] A spatiotemporal attention-based ResC3D model for large-scale gesture recognition
    Li, Yunan
    Miao, Qiguang
    Qi, Xiangda
    Ma, Zhenxin
    Ouyang, Wanli
    MACHINE VISION AND APPLICATIONS, 2019, 30 (05) : 875 - 888
  • [3] Stable and real-time hand gesture recognition based on RGB-D data
    Liu, Bo
    Wang, Guijin
    Chen, Xinghao
    He, Bei
    2013 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2013, 9045
  • [4] Sparse Representation Based Approach for RGB-D Hand Gesture Recognition
    Su, Te-Feng
    Fan, Chin-Yun
    Lin, Meng-Hsuan
    Lai, Shang-Hong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT II, 2015, 9315 : 564 - 570
  • [5] A spatiotemporal attention-based ResC3D model for large-scale gesture recognition
    Yunan Li
    Qiguang Miao
    Xiangda Qi
    Zhenxin Ma
    Wanli Ouyang
    Machine Vision and Applications, 2019, 30 : 875 - 888
  • [6] Selection of Large-Scale 3D Point Cloud Data Using Gesture Recognition
    Burgess, Robin
    Falcao, Antonio J.
    Fernandes, Tiago
    Ribeiro, Rita A.
    Gomes, Miguel
    Krone-Martins, Alberto
    de Almeida, Andre Moitinho
    TECHNOLOGICAL INNOVATION FOR CLOUD-BASED ENGINEERING SYSTEMS, 2015, 450 : 188 - 195
  • [7] Infrared and 3D Skeleton Feature Fusion for RGB-D Action Recognition
    De Boissiere, Alban Main
    Noumeir, Rita
    IEEE ACCESS, 2020, 8 (08): : 168297 - 168308
  • [8] Robust Hand Gesture Recognition Based on RGB-D Data for Natural Human-Computer Interaction
    Xu, Jun
    Wang, Hanchen
    Zhang, Jianrong
    Cai, Linqin
    IEEE ACCESS, 2022, 10 : 54549 - 54562
  • [9] SVM and RGB-D Sensor Based Gesture Recognition for UAV Control<bold> </bold>
    Aguilar, Wilbert G.
    Cobena, Bryan
    Rodriguez, Guillermo
    Salcedo, Vinicio S.
    Collaguazo, Brayan
    AUGMENTED REALITY, VIRTUAL REALITY, AND COMPUTER GRAPHICS, AVR 2018, PT II, 2018, 10851 : 713 - 719
  • [10] One-shot Learning Gesture Recognition based on Improved 3D SMoSIFT Feature Descriptor from RGB-D Videos
    Lin, Jia
    Ruan, Xiaogang
    Yu, Naigong
    Wei, Ruoyan
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4947 - 4952