Data Generation Using Gene Expression Generator

被引:7
作者
Farou, Zakarya [1 ]
Mouhoub, Noureddine [2 ]
Horvath, Tomas [1 ,3 ]
机构
[1] Eotvos Lorand Univ, Fac Informat, Telekom Innovat Labs, Dept Data Sci & Engn, Pazmany Peter Setany 1-C, H-1117 Budapest, Hungary
[2] Univ Bordeaux, Bordeaux Comp Sci Lab LaBRI, Sci & Technol Campus,351 Cours Liberat, F-33400 Talence, France
[3] Pavol Jozef Safarik Univ, Inst Comp Sci, Jesenna 5, Kosice 04001, Slovakia
来源
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2020, PT II | 2020年 / 12490卷
关键词
Data generation; Generative adversarial networks; Gene expression data; Cancer classification; CLASSIFICATION;
D O I
10.1007/978-3-030-62365-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative adversarial networks (GANs) could be used efficiently for image and video generation when labeled training data is available in bulk. In general, building a good machine learning model requires a reasonable amount of labeled training data. However, there are areas such as the biomedical field where the creation of such a dataset is time-consuming and requires expert knowledge. Thus, the aim is to use data augmentation techniques as an alternative to data collection to improve data classification. This paper presents the use of a modified version of a GAN called Gene Expression Generator (GEG) to augment the available data samples. The proposed approach was used to generate synthetic data for binary biomedical datasets to train existing supervised machine learning approaches. Experimental results show that the use of GEG for data augmentation with amodified version of leave one out cross-validation (LOOCV) increases the performance of classification accuracy.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 31 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 2012, [44] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," in Advances in neural information processing systems, 2012, pp. 1097-1105., P1097
[3]  
[Anonymous], RUTGERS STATE U NEW
[4]  
Antipov G, 2017, IEEE IMAGE PROC, P2089, DOI 10.1109/ICIP.2017.8296650
[5]  
Berthelot D, 2020, Arxiv, DOI arXiv:2003.02365
[6]   Classification of gene expression data: A hubness-aware semi-supervised approach [J].
Buza, Krisztian .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 127 :105-113
[7]   Advanced Customer Activity Prediction based on Deep Hierarchic Encoder-Decoders [J].
Damian, Andrei Ionut ;
Tapus, Nicolae ;
Piciu, Laurentiu ;
Turlea, Sergiu .
2019 22ND INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2019, :403-409
[8]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[9]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[10]  
Im DJ, 2016, Arxiv, DOI arXiv:1602.05110