Gesture Recognition Based on CNN and DCGAN for Calculation and Text Output

被引:42
作者
Fang, Wei [1 ,2 ]
Ding, Yewen [1 ]
Zhang, Feihong [1 ]
Sheng, Jack [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Jiangsu, Peoples R China
[3] Univ Cent Arkansas, Dept Econ Finance Insurance & Risk Management, Conway, AR 72035 USA
关键词
Calculation; CNN; DCGAN; gesture recognition; text output;
D O I
10.1109/ACCESS.2019.2901930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years, with the continuous improvement of hardware conditions, deep learning had performed well in solving many problems, such as visual recognition, speech recognition, and natural language processing. In recent years, human-computer interaction behavior has appeared more and more in daily life. Especially with the rapid development of computer vision technology, the human-centered human-computer interaction technology is bound to replace computer-centered human-computer interaction technology. The study of gesture recognition is in line with this trend, and gesture recognition provides a way for many devices to interact with humans. The traditional gesture recognition method requires manual extraction of feature values, which is a time-consuming and laborious method. In order to break through the bottleneck, we propose a new gesture recognition algorithm based on the convolutional neural network and deep convolution generative adversarial networks. We apply this method to expression recognition, calculation, and text output, and achieve good results. The experiments show that the proposed method can train the model to identify with fewer samples and achieve better gesture classification and detection effects. Moreover, this gesture recognition method is less susceptible to illumination and background interference. It also can achieve an efficient real-time recognition effect.
引用
收藏
页码:28230 / 28237
页数:8
相关论文
共 18 条
[1]  
[Anonymous], 2016, ICLR SAN JUAN PR US
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], 2012, NIPS LAK TAH NV US D
[4]  
[Anonymous], 2015, ICLR SAN DIEG CA US
[5]  
[Anonymous], CVPR BOST MA US JUN
[6]   A Method for Improving CNN-Based Image Recognition Using DCGAN [J].
Fang, Wei ;
Zhang, Feihong ;
Sheng, Victor S. ;
Ding, Yewen .
CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 57 (01) :167-178
[7]   NEOCOGNITRON - A NEURAL NETWORK MODEL FOR A MECHANISM OF VISUAL-PATTERN RECOGNITION [J].
FUKUSHIMA, K ;
MIYAKE, S ;
ITO, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :826-834
[8]  
Goodfellow I, 2014, NIPS, V27, DOI DOI 10.1145/3422622
[9]   RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE OF MONKEY STRIATE CORTEX [J].
HUBEL, DH ;
WIESEL, TN .
JOURNAL OF PHYSIOLOGY-LONDON, 1968, 195 (01) :215-&
[10]  
JongShill Lee, 2004, Conference Proceedings. 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.04CH37558), P1513