Enhancing Automated COVID-19 Chest X-ray Diagnosis by Image-to-Image GAN Translation

被引:10
作者
Liang, Zhaohui [1 ]
Huang, Jimmy Xiangji [2 ]
Li, Jun [3 ]
Chan, Stephen [4 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Sch Informat Technol, Toronto, ON, Canada
[3] Guangzhou Univ Chinese Med, Guangdong Prov Hosp Chinese Med, Guangzhou, Peoples R China
[4] Dapasoft INC, Toronto, ON, Canada
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2020年
关键词
COVID-19; generative adversarial network; GAN; image classification; deep learning;
D O I
10.1109/BIBM49941.2020.9313466
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The severe pneumonia induced by the infection of the SARS-CoV-2 virus causes massive death in the ongoing COVID-19 pandemic. The early detection of the SARS-CoV-2 induced pneumonia relies on the unique patterns of the chest X-Ray images. Deep learning is a data-greedy algorithm to achieve high performance when adequately trained. A common challenge for machine learning in the medical domain is the accessibility to properly annotated data. In this study, we apply a conditional adversarial network (cGAN) to perform image to image (Pix2Pix) translation from the non-COVID-19 chest X-Ray domain to the COVID-19 chest X-Ray domain. The objective is to learn a mapping from the normal chest X-Ray visual patterns to the COVID-19 pneumonia chest X-ray patterns. The original dataset has a typical imbalanced issue because it contains only 219 COVID-19 positive images but has 1,341 images for normal chest X-Ray and 1,345 images for viral pneumonia. A U-Net based architecture is applied for the image-to-image translation to generate synthesized COVID-19 X-Ray chest images from the normal chest X-ray images. A 50-convolutional-layer residual net (ResNet) architecture is applied for the final classification task. After training the GAN model for 100 epochs, we use the GAN generator to translate 1,100 COVID-19 images from the normal X-Ray to form a balanced training dataset (3,762 images) for the classification task. The ResNet based classifier trained by the enhanced dataset achieves the classification accuracy of 97.8% compared to 96.1% in the transfer learning mode. When trained with the original imbalanced dataset, the model achieves an accuracy of 96.1% compared to 95.6% in the training from trainby-scratch model. In addition, the classifier trained by the enhanced dataset has more stable measures in precision, recall, and F1 scores across different image classes. We conclude that the GAN-based data enhancement strategy is applicable to most medical image pattern recognition tasks, and it provides an effective way to solve the common expertise dependence issue in the medical domain.
引用
收藏
页码:1068 / 1071
页数:4
相关论文
共 14 条
[1]   A Survey on Deep Transfer Learning to Edge Computing for Mitigating the COVID-19 Pandemic [J].
Abu Sufian ;
Ghosh, Anirudha ;
Sadiq, Ali Safaa ;
Smarandache, Florentin .
JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 108
[2]  
[Anonymous], 2014, 27THINT C NEURAL INF
[3]   Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study [J].
Chen, Nanshan ;
Zhou, Min ;
Dong, Xuan ;
Qu, Jieming ;
Gong, Fengyun ;
Han, Yang ;
Qiu, Yang ;
Wang, Jingli ;
Liu, Ying ;
Wei, Yuan ;
Xia, Jia'an ;
Yu, Ting ;
Zhang, Xinxin ;
Zhang, Li .
LANCET, 2020, 395 (10223) :507-513
[4]   Antibody tests for identification of current and past infection with SARS-CoV-2 [J].
Deeks, Jonathan J. ;
Dinnes, Jacqueline ;
Takwoingil, Yemisi ;
Davenport, Clare ;
Spijker, Ren ;
Taylor-Phillips, Sian ;
Adrianol, Ada ;
Beesel, Sophie ;
Dretzkel, Janine ;
di Ruffanol, Lavinia Ferrante ;
Harris, Isobel M. ;
Price, Malcolm J. ;
Dittrich, Sabine ;
Emperador, Devy ;
Hooft, Lotty ;
Leeflang, Mariska M. G. ;
Van den Bruel, Ann .
COCHRANE DATABASE OF SYSTEMATIC REVIEWS, 2020, (06)
[5]  
Gauthier JP, 2014, CONF P INDIUM PHOSPH
[6]   Real-time Automatic License Plate Recognition Through Deep Multi-Task Networks [J].
Goncalves, Gabriel R. ;
Diniz, Matheus A. ;
Laroca, Rayson ;
Menotti, David ;
Schwartz, William Robson .
PROCEEDINGS 2018 31ST SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2018, :110-117
[7]   Accurate Screening of COVID-19 Using Attention-Based Deep 3D Multiple Instance Learning [J].
Han, Zhongyi ;
Wei, Benzheng ;
Hong, Yanfei ;
Li, Tianyang ;
Cong, Jinyu ;
Zhu, Xue ;
Wei, Haifeng ;
Zhang, Wei .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (08) :2584-2594
[8]   Antiviral antibody responses: the two extremes of a wide spectrum [J].
Hangartner, L ;
Zinkernagel, RM ;
Hengartner, H .
NATURE REVIEWS IMMUNOLOGY, 2006, 6 (03) :231-243
[9]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[10]   Conditional generative adversarial network for 3D rigid-body motion correction in MRI [J].
Johnson, Patricia M. ;
Drangova, Maria .
MAGNETIC RESONANCE IN MEDICINE, 2019, 82 (03) :901-910