Fusion of 2D CNN and 3D DenseNet for Dynamic Gesture Recognition

被引:29
作者
Zhang, Erhu [1 ]
Xue, Botao [1 ]
Cao, Fangzhou [1 ]
Duan, Jinghong [2 ]
Lin, Guangfeng [1 ]
Lei, Yifei [3 ]
机构
[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China
[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[3] Changan Univ, Sch Elect & Control Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
gesture recognition; motion representation; 2D CNN; 3D DenseNet; information fusion; FLOW;
D O I
10.3390/electronics8121511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gesture recognition has been applied in many fields as it is a natural human-computer communication method. However, recognition of dynamic gesture is still a challenging topic because of complex disturbance information and motion information. In this paper, we propose an effective dynamic gesture recognition method by fusing the prediction results of a two-dimensional (2D) motion representation convolution neural network (CNN) model and three-dimensional (3D) dense convolutional network (DenseNet) model. Firstly, to obtain a compact and discriminative gesture motion representation, the motion history image (MHI) and pseudo-coloring technique were employed to integrate the spatiotemporal motion sequences into a frame image, before being fed into a 2D CNN model for gesture classification. Next, the proposed 3D DenseNet model was used to extract spatiotemporal features directly from Red, Green, Blue (RGB) gesture videos. Finally, the prediction results of the proposed 2D and 3D deep models were blended together to boost recognition performance. The experimental results on two public datasets demonstrate the effectiveness of our proposed method.
引用
收藏
页数:15
相关论文
共 37 条
  • [1] Improving weapon detection in single energy X-ray images through pseudocoloring
    Abidi, Besma R.
    Zheng, Yue
    Gribok, Andrei V.
    Abidi, Mongi A.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2006, 36 (06): : 784 - 796
  • [2] The recognition of human movement using temporal templates
    Bobick, AF
    Davis, JW
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) : 257 - 267
  • [3] DMMs-Based Multiple Features Fusion for Human Action Recognition
    Bulbul, Mohammad Farhad
    Jiang, Yunsheng
    Ma, Jinwen
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2015, 6 (04) : 23 - 39
  • [4] Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
  • [5] Jointly network: a network based on CNN and RBM for gesture recognition
    Cheng, Wentao
    Sun, Ying
    Li, Gongfa
    Jiang, Guozhang
    Liu, Honghai
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 1) : 309 - 323
  • [6] An Efficient Hand Gesture Recognition System Based on Deep CNN
    Chung, Hung-Yuan
    Chung, Yao-Liang
    Tsai, Wei-Feng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 853 - 858
  • [7] Danafar S, 2007, LECT NOTES COMPUT SC, V4844, P457
  • [8] De Smedt Q., 2017, WORKSHOP 3D OBJECT R, P33
  • [9] Skeleton-based Dynamic hand gesture recognition
    De Smedt, Quentin
    Wannous, Hazem
    Vandeborre, Jean-Philippe
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1206 - 1214
  • [10] 3-D Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold
    Devanne, Maxime
    Wannous, Hazem
    Berretti, Stefano
    Pala, Pietro
    Daoudi, Mohamed
    Del Bimbo, Alberto
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (07) : 1340 - 1352