Gesture Recognition Based on CNN and DCGAN for Calculation and Text Output

被引：42

作者：

Fang, Wei ^{[1
,2
]}

Ding, Yewen ^{[1
]}

Zhang, Feihong ^{[1
]}

Sheng, Jack ^{[3
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China

[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Jiangsu, Peoples R China

[3] Univ Cent Arkansas, Dept Econ Finance Insurance & Risk Management, Conway, AR 72035 USA

来源：

IEEE ACCESS | 2019年 / 7卷

关键词：

Calculation; CNN; DCGAN; gesture recognition; text output;

D O I：

10.1109/ACCESS.2019.2901930

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the past few years, with the continuous improvement of hardware conditions, deep learning had performed well in solving many problems, such as visual recognition, speech recognition, and natural language processing. In recent years, human-computer interaction behavior has appeared more and more in daily life. Especially with the rapid development of computer vision technology, the human-centered human-computer interaction technology is bound to replace computer-centered human-computer interaction technology. The study of gesture recognition is in line with this trend, and gesture recognition provides a way for many devices to interact with humans. The traditional gesture recognition method requires manual extraction of feature values, which is a time-consuming and laborious method. In order to break through the bottleneck, we propose a new gesture recognition algorithm based on the convolutional neural network and deep convolution generative adversarial networks. We apply this method to expression recognition, calculation, and text output, and achieve good results. The experiments show that the proposed method can train the model to identify with fewer samples and achieve better gesture classification and detection effects. Moreover, this gesture recognition method is less susceptible to illumination and background interference. It also can achieve an efficient real-time recognition effect.

引用

页码：28230 / 28237

页数：8

共 18 条

[11] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

[12] A Fusion Steganographic Algorithm Based on Faster R-CNN [J].

Meng, Ruohan ;

Rice, Steven G. ;

Wang, Jin ;

Sun, Xingming .

CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 55 (01) :1-16

[13] KinectFusion: Real-Time Dense Surface Mapping and Tracking [J].

Newcombe, Richard A. ;

Izadi, Shahram ;

Hilliges, Otmar ;

Molyneaux, David ;

Kim, David ;

Davison, Andrew J. ;

Kohli, Pushmeet ;

Shotton, Jamie ;

Hodges, Steve ;

Fitzgibbon, Andrew .

2011 10TH IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR), 2011, :127-136

[14]

Shin S, 2016, IEEE INT SYMP CIRC S, P2274, DOI 10.1109/ISCAS.2016.7539037

[15]

Takahashi T., 1991, SIGCHI Bulletin, V23, P67, DOI 10.1145/122488.122499

[16] Vision-Based Hand-Gesture Applications [J].

Wachs, Juan Pablo ;

Koelsch, Mathias ;

Stern, Helman ;

Edan, Yael .

COMMUNICATIONS OF THE ACM, 2011, 54 (02) :60-71

[17] Paragraph Vector Representation Based on Word to Vector and CNN Learning [J].

Xiong, Zeyu ;

Shen, Qiangqiang ;

Wang, Yijie ;

Zhu, Chenyang .

CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 55 (02) :213-227

[18] Visualizing and Understanding Convolutional Networks [J].

Zeiler, Matthew D. ;

Fergus, Rob .

COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :818-833

← 1 2 →