WGAN-Based Synthetic Minority Over-Sampling Technique: Improving Semantic Fine-Grained Classification for Lung Nodules in CT Images

被引:67
作者
Wang, Qingfeng [1 ,2 ]
Zhou, Xuehai [1 ]
Wang, Chao [1 ]
Liu, Zhiqin [2 ]
Huang, Jun [2 ]
Zhou, Ying [3 ]
Li, Changlong [1 ]
Zhuang, Hang [1 ]
Cheng, Jie-Zhi [4 ]
机构
[1] Univ Sci & Technol China, Sch Software Engn, Hefei 230026, Anhui, Peoples R China
[2] Southwest Univ Sci & Technol, Sch Comp Sci & Technol, Mianyang 621010, Peoples R China
[3] Mianyang Cent Hosp, Radiol Dept, Mianyang 621000, Peoples R China
[4] Shanghai United Imaging Intelligence Co Ltd, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer-aided diagnosis (CAD); lung nodule; computed tomography (CT); synthetic minority over-sampling; deep learning; data imbalance; adversarial neural networks; DATABASE CONSORTIUM LIDC; REDUCTION; TEXT;
D O I
10.1109/ACCESS.2019.2896409
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data imbalance issue generally exists in most medical image analysis problems and maybe getting important with the popularization of data-hungry deep learning paradigms. We explore the cutting-edge Wasserstein generative adversarial networks (WGANs) to address the data imbalance problem with oversampling on the minority classes. The WGAN can estimate the underlying distribution of a minority class to synthesize more plausible and helpful samples for the classification model. In this paper, the WGAN-based over-sampling technique is applied to augment the data to balance for the fine-grained classification of seven semantic attributes of lung nodules in computed tomography images. The fine-grained classification is carried out with a normal convolutional neural network (CNN). To further illustrate the efficacy of the WGAN-based over-sampling technique, the conventional data augmentation method commonly used in many deep learning works, the generative adversarial networks (GANs), and the deep convolutional generative adversarial networks (DCGANs) are implemented for comparison. The whole schemes of the minority oversampling and fine-grained classification are tested with the public lung imaging database consortium dataset. The experimental results suggest that the WGAN-based oversampling technique can synthesize helpful samples for the minority classes to assist the training of the CNN model and to boost the fine-grained classification performance better than the conventional data augmentation method and the two schemes of the GAN and DCGAN techniques do. It may thus suggest that the WGAN technique offers an alternative methodological option for the further deep learning on imbalanced classification studies.
引用
收藏
页码:18450 / 18463
页数:14
相关论文
共 44 条
[1]   To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques [J].
Abdi, Lida ;
Hashemi, Sattar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) :238-251
[2]  
Alec R., 2016, 4 INT C LEARN REPR I, P1
[3]  
[Anonymous], SCAN STRUCTURE CORRE
[4]  
[Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.632
[5]  
[Anonymous], ACAD RADIOL
[6]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[7]   Ultrasound Standard Plane Detection Using a Composite Neural Network Framework [J].
Chen, Hao ;
Wu, Lingyun ;
Dou, Qi ;
Qin, Jing ;
Li, Shengli ;
Cheng, Jie-Zhi ;
Ni, Dong ;
Heng, Pheng-Ann .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) :1576-1586
[8]  
Chen JC, 2016, SCI REP-UK, V6, DOI [10.1038/srep24454, 10.1038/srep25671]
[9]   Automatic Scoring of Multiple Semantic Attributes With Multi-Task Feature Leverage: A Study on Pulmonary Nodules in CT Images [J].
Chen, Sihong ;
Qin, Jing ;
Ji, Xing ;
Lei, Baiying ;
Wang, Tianfu ;
Ni, Dong ;
Cheng, Jie-Zhi .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (03) :802-814
[10]  
Cheng Bian, 2017, Medical Image Computing and Computer Assisted Intervention MICCAI 2017. 20th International Conference. Proceedings: LNCS 10435, P259, DOI 10.1007/978-3-319-66179-7_30