Data augmentation for handwritten digit recognition using generative adversarial networks

被引:14
作者
Jha, Ganesh [1 ]
Cecotti, Hubert [1 ]
机构
[1] Calif State Univ Fresno Fresno State, Coll Sci & Math, Dept Comp Sci, 2576 E San Ramon MS ST 109, Fresno, CA 93740 USA
关键词
Machine learning; Neural networks; Classification; Generative adversarial networks; CHARACTER-RECOGNITION;
D O I
10.1007/s11042-020-08883-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning techniques require labeled examples that can be time consuming to obtain. In particular, deep learning approaches, where all the feature extraction stages are learned within the artificial neural network, require a large number of labeled examples to train the model. Various data augmentation techniques can be performed to overcome this issue by taking advantage of known variations that have no impact on the label of an example. Typical solutions in computer vision and document analysis and recognition are based on geometric transformations (e.g. shift and rotation) and random elastic deformations of the original training examples. In this paper, we consider Generative Adversarial Networks (GAN), a technique that does not require prior knowledge of the possible variabilities that exist across examples to create novel artificial examples. In the case of a training dataset with a low number of labeled examples, which are described in a high dimensional space, the classifier may generalize poorly. Therefore, we aim at enriching databases of images or signals for improving the classifier performance by designing a GAN for creating artificial images. While adding more images through a GAN can help, the extent to which it will help is unknown, and it may degrade the performance if too many artificial images are added. The approach is tested on four datasets on handwritten digits (Latin, Bangla, Devanagri, and Oriya). The accuracy for each dataset shows that the addition of GAN generated images in the training dataset provides an improvement of the accuracy. However, the results suggest that the addition of too many GAN generated images deteriorates the performance.
引用
收藏
页码:35055 / 35068
页数:14
相关论文
共 50 条
  • [31] Seismic Data Augmentation Based on Conditional Generative Adversarial Networks
    Li, Yuanming
    Ku, Bonhwa
    Zhang, Shou
    Ahn, Jae-Kwang
    Ko, Hanseok
    SENSORS, 2020, 20 (23) : 1 - 13
  • [32] Efficient Classification of Imbalanced Natural Disasters Data Using Generative Adversarial Networks for Data Augmentation
    Eltehewy, Rokaya
    Abouelfarag, Ahmed
    Saleh, Sherine Nagy
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (06)
  • [33] Data augmentation through multivariate scenario forecasting in Data Centers using Generative Adversarial Networks
    Jaime Pérez
    Patricia Arroba
    José M. Moya
    Applied Intelligence, 2023, 53 : 1469 - 1486
  • [34] Assisting Barrett's esophagus identification using endoscopic data augmentation based on Generative Adversarial Networks
    de Souza Jr, Luis A.
    Passos, Leandro A.
    Mendel, Robert
    Ebigbo, Alanna
    Probst, Andreas
    Messmann, Helmut
    Palm, Christoph
    Papa, Joao P.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 126
  • [35] The use of generative adversarial networks in medical image augmentation
    Makhlouf, Ahmed
    Maayah, Marina
    Abughanam, Nada
    Catal, Cagatay
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34) : 24055 - 24068
  • [36] Urdu Handwritten Ligature Generation Using Generative Adversarial Networks (GANs)
    Sharif, Marium
    Ul-Hasan, Adnan
    Shafait, Faisal
    FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 421 - 435
  • [37] Augmentation of a Virtual Reality Environment Using Generative Adversarial Networks
    Franchi, Valerio
    Ntagiou, Evridiki
    2021 4TH IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR 2021), 2021, : 219 - 223
  • [38] Population-scale Genomic Data Augmentation Based on Conditional Generative Adversarial Networks
    Chen, Junjie
    Mowlaei, Mohammad Erfan
    Shi, Xinghua
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [39] Improved Generative Adversarial Networks With Filtering Mechanism for Fault Data Augmentation
    Shao, Lexuan
    Lu, Ningyun
    Jiang, Bin
    Simani, Silvio
    Song, Le
    Liu, Zhengyuan
    IEEE SENSORS JOURNAL, 2023, 23 (13) : 15176 - 15187
  • [40] Data augmentation for patch-based OCT chorio-retinal segmentation using generative adversarial networks
    Jason Kugelman
    David Alonso-Caneiro
    Scott A. Read
    Stephen J. Vincent
    Fred K. Chen
    Michael J. Collins
    Neural Computing and Applications, 2021, 33 : 7393 - 7408