Data augmentation for handwritten digit recognition using generative adversarial networks

被引:1
作者
Ganesh Jha
Hubert Cecotti
机构
[1] California State University,Department of Computer Science, College of Science and Mathematics
[2] Fresno (Fresno State),undefined
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
Machine learning; Neural networks; Classification; Generative adversarial networks;
D O I
暂无
中图分类号
学科分类号
摘要
Supervised learning techniques require labeled examples that can be time consuming to obtain. In particular, deep learning approaches, where all the feature extraction stages are learned within the artificial neural network, require a large number of labeled examples to train the model. Various data augmentation techniques can be performed to overcome this issue by taking advantage of known variations that have no impact on the label of an example. Typical solutions in computer vision and document analysis and recognition are based on geometric transformations (e.g. shift and rotation) and random elastic deformations of the original training examples. In this paper, we consider Generative Adversarial Networks (GAN), a technique that does not require prior knowledge of the possible variabilities that exist across examples to create novel artificial examples. In the case of a training dataset with a low number of labeled examples, which are described in a high dimensional space, the classifier may generalize poorly. Therefore, we aim at enriching databases of images or signals for improving the classifier performance by designing a GAN for creating artificial images. While adding more images through a GAN can help, the extent to which it will help is unknown, and it may degrade the performance if too many artificial images are added. The approach is tested on four datasets on handwritten digits (Latin, Bangla, Devanagri, and Oriya). The accuracy for each dataset shows that the addition of GAN generated images in the training dataset provides an improvement of the accuracy. However, the results suggest that the addition of too many GAN generated images deteriorates the performance.
引用
收藏
页码:35055 / 35068
页数:13
相关论文
共 55 条
[1]  
Belongie S(2002)Shape matching and object recognition using shape contexts IEEE Trans Pattern Anal Mach Intell 24 509-522
[2]  
Malik J(1998)A complete printed Bangla OCR system Pattern Recogn 31 531-549
[3]  
Puzicha J(2015)Rotation-invariant convolutional neural networks for galaxy morphology prediction Mon Not R Astron Soc 450 1441-1459
[4]  
Chaudhuri BB(2017)Dermatologist-level classification of skin cancer with deep neural networks Nature 542 115-118
[5]  
Pal U(2015)Handwritten marathi character recognition using R-HOG feature Procedia Comput Sci 45 266-274
[6]  
Dieleman S(2017)Deep neural network for handwritten marathi character recognition Int J Imag Robot 17 95-107
[7]  
Willett KW(2007)Deformation models for image recognition IEEE Trans Pattern Anal Machs Intell 29 1422-1435
[8]  
Dambre J(1998)Gradient-based learning applied to document recognition Proc IEEE 86 2278-2324
[9]  
Esteva A(2015)Deep learning Nature 521 436-444
[10]  
Kuprel B(2014)Deep learning of the tissue-regulated splicing code Bioinformatics 30 i121-i129