Data augmentation using MG-GAN for improved cancer classification on gene expression data

被引:0
作者
Poonam Chaudhari
Himanshu Agrawal
Ketan Kotecha
机构
[1] Gokhale Education Society’s R. H. Sapat College of Engineering,
[2] Management Studies and Research,undefined
[3] Symbiosis Institute of Technology,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Data augmentation; Generative adversarial network; Gene expression dataset; Cancer detection; Modified generator GAN; Multivariate noise; Gaussian distribution; Latent space; Saddle point;
D O I
暂无
中图分类号
学科分类号
摘要
Molecular biology studies on cancer, using gene expression datasets, have revealed that the datasets have a very small number of samples. Obtaining medical data is difficult and expensive due to privacy constraints. Accuracy of classifiers depends greatly on the quality and quantity of input data. The problem of small sample size or small data size has been addressed by augmentation. Owing to the sensitivity of synthetic data samples for the cancer data classification for gene expression data, this paper is motivated to investigate data augmentation using GAN. GAN is based on the principle of two blocks (generator and discriminator) working in a collaborative yet adversarial way. This paper proposes modified generator GAN (MG-GAN) where the generator is fed with original data and multivariate noise to generate data with Gaussian distribution. As the generated data lie within latent space, we reach saddle point faster. GAN has been widely used in data augmentation for image datasets. As per our understanding, this is the first attempt of using GAN for augmentation on gene expression dataset. The performance merit of proposed MG-GAN was compared with KNN and Basic GAN. As compared to KNN and GAN, MG-GAN improves classification accuracy by 18.8% and 11.9%, respectively. The loss value of the error function for MG-GAN is drastically reduced, from 0.6978 to 0.0082, ensuring sensitivity of the generated data. Improved classification accuracy and reduction in the loss value make our improved MG-GAN method better suited for critical applications with sensitive data.
引用
收藏
页码:11381 / 11391
页数:10
相关论文
共 90 条
  • [21] Elloumi M(2019)Evolutionary generative adversarial networks IEEE Trans Evol Comput 66 8244-5063
  • [22] Konidaris F(2012)Cancer bioinformatics: a new approach to systems clinical medicine BMC Bioinf 38 1750-undefined
  • [23] Tagaris T(2018)Multi-view generative adversarial network and its application in pearl classification IEEE Trans Industr Electron 56 5046-undefined
  • [24] Sdraka M(2019)Ea-GANs: edge-aware generative adversarial networks for cross-modality MR image synthesis IEEE Trans Med Imaging undefined undefined-undefined
  • [25] Stafylopatis A(2018)Generative adversarial networks for hyperspectral image classification IEEE Trans Geosci Remote Sens undefined undefined-undefined
  • [26] Li Y(undefined)undefined undefined undefined undefined-undefined
  • [27] Xiao N(undefined)undefined undefined undefined undefined-undefined
  • [28] Ouyang W(undefined)undefined undefined undefined undefined-undefined
  • [29] Li J(undefined)undefined undefined undefined undefined-undefined
  • [30] He H(undefined)undefined undefined undefined undefined-undefined