On Urinary Bladder Cancer Diagnosis: Utilization of Deep Convolutional Generative Adversarial Networks for Data Augmentation

被引:16
作者
Lorencin, Ivan [1 ]
Baressi Segota, Sandi [1 ]
Andelic, Nikola [1 ]
Mrzljak, Vedran [1 ]
Cabov, Tomislav [2 ]
Spanjol, Josip [3 ,4 ]
Car, Zlatan [1 ]
机构
[1] Univ Rijeka, Fac Engn, Vukovarska 58, Rijeka 51000, Croatia
[2] Univ Rijeka, Fac Med Dent, Kresimirova 40-4220-1, Rijeka 51000, Croatia
[3] Univ Rijeka, Fac Med, Ul Brace Branchetta 20-1, Rijeka 51000, Croatia
[4] Clin Hosp Ctr, Rijeka 51000, Croatia
来源
BIOLOGY-BASEL | 2021年 / 10卷 / 03期
关键词
AlexNet; data augmentation; deep convolutional generative adversarial networks; urinary bladder cancer; VGG16; SQUAMOUS-CELL CARCINOMA; ADENOCARCINOMA;
D O I
10.3390/biology10030175
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simple Summary One of the main challenges in the application of Machine Learning in medicine is data collection. Either due to ethical concerns or lack of patients, data may be scarce. In this paper Deep Convolutional Generative Adversarial Networks (DCGAN) have been applied for the purpose of data augmentation. Images of bladder mucosa are used in order to generate new images using DCGANs. Then, combination of original and generated images are used to train AlexNet and VGG16 architectures. The results show improvements in AUC score in some cases, or equal scores with apparent lowering of standard deviation across data folds during cross-validation; indicating networks trained with the addition of generated data have a lower sensitivity across the hyperparameter range. Urinary bladder cancer is one of the most common urinary tract cancers. Standard diagnosis procedure can be invasive and time-consuming. For these reasons, procedure called optical biopsy is introduced. This procedure allows in-vivo evaluation of bladder mucosa without the need for biopsy. Although less invasive and faster, accuracy is often lower. For this reason, machine learning (ML) algorithms are used to increase its accuracy. The issue with ML algorithms is their sensitivity to the amount of input data. In medicine, collection can be time-consuming due to a potentially low number of patients. For these reasons, data augmentation is performed, usually through a series of geometric variations of original images. While such images improve classification performance, the number of new data points and the insight they provide is limited. These issues are a motivation for the application of novel augmentation methods. Authors demonstrate the use of Deep Convolutional Generative Adversarial Networks (DCGAN) for the generation of images. Augmented datasets used for training of commonly used Convolutional Neural Network-based (CNN) architectures (AlexNet and VGG-16) show a significcan performance increase for AlexNet, where AUCmicro reaches values up to 0.99. Average and median results of networks used in grid-search increases. These results point towards the conclusion that GAN-based augmentation has decreased the networks sensitivity to hyperparemeter change.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2015, TIME FREQUENCY SIGNA
[3]  
[Anonymous], 2014, INT C LEARN REPRESEN
[4]  
[Anonymous], 2016, Distill, DOI DOI 10.23915/DISTILL.00003
[5]  
Bishop C.M., 2006, Pattern Recognition and Machine Learning, DOI DOI 10.1007/978-0-387-45528-0
[6]  
Bogovic K., 2018, IN TECH 2018
[7]  
Car Z., 2020, ENABLING APPL DATA S, P225
[8]  
Celis JE, 2000, ELECTROPHORESIS, V21, P2115
[9]   Generative Adversarial Networks An overview [J].
Creswell, Antonia ;
White, Tom ;
Dumoulin, Vincent ;
Arulkumaran, Kai ;
Sengupta, Biswa ;
Bharath, Anil A. .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :53-65
[10]  
Dadhania V, 2015, AM J CLIN EXP UROL, V3, P51