Fusion of 2D CNN and 3D DenseNet for Dynamic Gesture Recognition

被引：29

作者：

Zhang, Erhu ^{[1
]}

Xue, Botao ^{[1
]}

Cao, Fangzhou ^{[1
]}

Duan, Jinghong ^{[2
]}

Lin, Guangfeng ^{[1
]}

Lei, Yifei ^{[3
]}

机构：

[1] Xian Univ Technol, Dept Informat Sci, Xian 710048, Peoples R China

[2] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China

[3] Changan Univ, Sch Elect & Control Engn, Xian 710064, Peoples R China

来源：

ELECTRONICS | 2019年 / 8卷 / 12期

基金：

中国国家自然科学基金;

关键词：

gesture recognition; motion representation; 2D CNN; 3D DenseNet; information fusion; FLOW;

D O I：

10.3390/electronics8121511

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Gesture recognition has been applied in many fields as it is a natural human-computer communication method. However, recognition of dynamic gesture is still a challenging topic because of complex disturbance information and motion information. In this paper, we propose an effective dynamic gesture recognition method by fusing the prediction results of a two-dimensional (2D) motion representation convolution neural network (CNN) model and three-dimensional (3D) dense convolutional network (DenseNet) model. Firstly, to obtain a compact and discriminative gesture motion representation, the motion history image (MHI) and pseudo-coloring technique were employed to integrate the spatiotemporal motion sequences into a frame image, before being fed into a 2D CNN model for gesture classification. Next, the proposed 3D DenseNet model was used to extract spatiotemporal features directly from Red, Green, Blue (RGB) gesture videos. Finally, the prediction results of the proposed 2D and 3D deep models were blended together to boost recognition performance. The experimental results on two public datasets demonstrate the effectiveness of our proposed method.

引用

页数：15

共 37 条

[1] Improving weapon detection in single energy X-ray images through pseudocoloring
Abidi, Besma R.
Zheng, Yue
Gribok, Andrei V.
Abidi, Mongi A.
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2006, 36 (06): : 784 - 796
[2] The recognition of human movement using temporal templates
Bobick, AF
Davis, JW
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) : 257 - 267
[3] DMMs-Based Multiple Features Fusion for Human Action Recognition
Bulbul, Mohammad Farhad
Jiang, Yunsheng
Ma, Jinwen
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2015, 6 (04) : 23 - 39
[4] Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
[5] Jointly network: a network based on CNN and RBM for gesture recognition
Cheng, Wentao
Sun, Ying
Li, Gongfa
Jiang, Guozhang
Liu, Honghai
[J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 1) : 309 - 323
[6] An Efficient Hand Gesture Recognition System Based on Deep CNN
Chung, Hung-Yuan
Chung, Yao-Liang
Tsai, Wei-Feng
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 853 - 858
[7] Danafar S, 2007, LECT NOTES COMPUT SC, V4844, P457
[8] De Smedt Q., 2017, WORKSHOP 3D OBJECT R, P33
[9] Skeleton-based Dynamic hand gesture recognition
De Smedt, Quentin
Wannous, Hazem
Vandeborre, Jean-Philippe
[J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 1206 - 1214
[10] 3-D Human Action Recognition by Shape Analysis of Motion Trajectories on Riemannian Manifold
Devanne, Maxime
Wannous, Hazem
Berretti, Stefano
Pala, Pietro
Daoudi, Mohamed
Del Bimbo, Alberto
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (07) : 1340 - 1352

← 1 2 3 4 →