ROI-based feature learning for efficient true positive prediction using convolutional neural network for lung cancer diagnosis

被引:45
作者
Suresh, Supriya [1 ]
Mohan, Subaji [2 ]
机构
[1] Vellore Inst Technol, Sch Engn & Comp Sci, Vellore, Tamil Nadu, India
[2] Vellore Inst Technol, Inst Ind & Int Programmes, Vellore, Tamil Nadu, India
关键词
Convolutional neural network; Deep structured algorithm; Batch normalization; ReLu; Max-pooling; Softmax; Benign and malignant; Morphological features; PULMONARY NODULE DETECTION; COMPUTER-AIDED DIAGNOSIS; CT IMAGES; CLASSIFICATION; SYSTEM; ALGORITHMS; REDUCTION;
D O I
10.1007/s00521-020-04787-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural network (CNN) is one of the deep structured algorithms widely applied to analyze the ability to visualize and extract the hidden texture features of image datasets. The study aims to automatically extract the self-learned features using an end-to-end learning CNN and compares the results with the conventional state-of-art and traditional computer-aided diagnosis system's performance. The architecture consists of eight layers: one input layer, three convolutional layers and three sub-sampling layers intercepted with batch normalization, ReLu and max-pooling for salient feature extraction, and one fully connected layer that uses softmax function connected to 3 neurons as output layer, classifying an input image into one of three classes categorized as nodules >= 3 mm as benign (low malignancy nodules), malignant (high malignancy nodules), and nodules < 3 mm and non-nodules >= 3 mm combined as non-cancerous. For the input layer, lung nodule CT images are acquired from the Lung Image Database Consortium public repository having 1018 cases. Images are pre-processed to uniquely segment the nodule region of interest (NROI) in correspondence to four radiologists' annotations and markings describing the coordinates and ground-truth values. A two-dimensional set of re-sampled images of size 52 x 52 pixels with random translation, rotation, and scaling corresponding to the NROI are generated as input samples. In addition, generative adversarial networks (GANs) are employed to generate additional images with similar characteristics as pulmonary nodules. CNNs are trained using images generated by GAN and are fine-tuned with actual input samples to differentiate and classify the lung nodules based on the classification strategy. The pre-trained and fine-tuned process upon the trained network's architecture results in aggregate probability scores for nodule detection reducing false positives. A total of 5188 images with an augmented image data store are used to enhance the performance of the network in the study generating high sensitivity scores with good true positives. Our proposed CNN achieved the classification accuracy of 93.9%, an average specificity of 93%, and an average sensitivity of 93.4% with reduced false positives and evaluated the area under the receiver operating characteristic curve with the highest observed value of 0.934 using the GAN generated images.
引用
收藏
页码:15989 / 16009
页数:21
相关论文
共 58 条
  • [2] Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening
    Aberle, Denise R.
    Adams, Amanda M.
    Berg, Christine D.
    Black, William C.
    Clapp, Jonathan D.
    Fagerstrom, Richard M.
    Gareen, Ilana F.
    Gatsonis, Constantine
    Marcus, Pamela M.
    Sicks, JoRean D.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2011, 365 (05) : 395 - 409
  • [3] Al-Fahoum A.S., 2014, J Biomed Graph Comput, V4, P33, DOI [DOI 10.5430/JBGC.V4N2P33, 10.5430/jbgc.v4n2p33]
  • [4] Amer HM, 2018, P 7 INT C SOFTW INF, P81
  • [5] [Anonymous], 2019, Lung cancer fact sheet
  • [6] Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm
    Arabasadi, Zeinab
    Alizadehsani, Roohallah
    Roshanzamir, Mohamad
    Moosaei, Hossein
    Yarifard, Ali Asghar
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 141 : 19 - 26
  • [7] End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography
    Ardila, Diego
    Kiraly, Atilla P.
    Bharadwaj, Sujeeth
    Choi, Bokyung
    Reicher, Joshua J.
    Peng, Lily
    Tse, Daniel
    Etemadi, Mozziyar
    Ye, Wenxing
    Corrado, Greg
    Naidich, David P.
    Shetty, Shravya
    [J]. NATURE MEDICINE, 2019, 25 (06) : 954 - +
  • [8] The Lung Image Database Consortium, (LIDC) and Image Database Resource Initiative (IDRI): A Completed Reference Database of Lung Nodules on CT Scans
    Armato, Samuel G., III
    McLennan, Geoffrey
    Bidaut, Luc
    McNitt-Gray, Michael F.
    Meyer, Charles R.
    Reeves, Anthony P.
    Zhao, Binsheng
    Aberle, Denise R.
    Henschke, Claudia I.
    Hoffman, Eric A.
    Kazerooni, Ella A.
    MacMahon, Heber
    van Beek, Edwin J. R.
    Yankelevitz, David
    Biancardi, Alberto M.
    Bland, Peyton H.
    Brown, Matthew S.
    Engelmann, Roger M.
    Laderach, Gary E.
    Max, Daniel
    Pais, Richard C.
    Qing, David P-Y
    Roberts, Rachael Y.
    Smith, Amanda R.
    Starkey, Adam
    Batra, Poonam
    Caligiuri, Philip
    Farooqi, Ali
    Gladish, Gregory W.
    Jude, C. Matilda
    Munden, Reginald F.
    Petkovska, Iva
    Quint, Leslie E.
    Schwartz, Lawrence H.
    Sundaram, Baskaran
    Dodd, Lori E.
    Fenimore, Charles
    Gur, David
    Petrick, Nicholas
    Freymann, John
    Kirby, Justin
    Hughes, Brian
    Casteele, Alessi Vande
    Gupte, Sangeeta
    Sallam, Maha
    Heath, Michael D.
    Kuhn, Michael H.
    Dharaiya, Ekta
    Burns, Richard
    Fryd, David S.
    [J]. MEDICAL PHYSICS, 2011, 38 (02) : 915 - 931
  • [9] Arulmurugan R, 2018, L N COMPUT VIS BIOME, V28, P103, DOI 10.1007/978-3-319-71767-8_9
  • [10] Detection of Cancer in Lung With K-NN Classification Using Genetic Algorithm
    Bhuvaneswari, P.
    Therese, A. Brintha
    [J]. 2ND INTERNATIONAL CONFERENCE ON NANOMATERIALS AND TECHNOLOGIES (CNT 2014), 2015, 10 : 433 - 440