Thorax disease classification with attention guided convolutional neural network

被引:69
作者
Guan, Qingji [1 ,2 ]
Huang, Yaping [1 ]
Zhong, Zhun [2 ,3 ]
Zheng, Zhedong [2 ]
Zheng, Liang [2 ]
Yang, Yi [2 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, 3 Shangyuancun, Beijing 100044, Peoples R China
[2] Univ Technol Sydney, Ctr Artificial Intelligence, ReLER Lab, 15 Broadway, Sydnedy, NSW 2007, Australia
[3] Xiamen Univ, Dept Artifical Intelligence, Xiamen, Fujian, Peoples R China
关键词
CXR image classification; Visual attention; Feature ensemble;
D O I
10.1016/j.patrec.2019.11.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers the task of thorax disease diagnosis on chest X-ray (CXR) images. Most existing methods generally learn a network with global images as input. However, thorax diseases usually happen in (small) localized areas which are disease specific. Thus training CNNs using global images may be affected by the (excessive) irrelevant noisy areas. Besides, due to the poor alignment of some CXR images, the existence of irregular borders hinders the network performance. For addressing the above problems, we propose to integrate the global and local cues into a three-branch attention guided convolution neural network (AG-CNN) to identify thorax diseases. An attention guided mask inference based cropping strategy is proposed to avoid noise and improve alignment in the global branch. AG-CNN also integrates the global cues to compensate the lost discriminative cues by the local branch. Specifically, we first learn a global CNN branch using global images. Then, guided by the attention heatmap generated from the global branch, we infer a mask to crop a discriminative region from the global image. The local region is used for training a local CNN branch. Lastly, we concatenate the last pooling layers of both the global and local branches for fine-tuning the fusion branch. Experiments on the ChestX-ray14 dataset demonstrate that after integrating the local cues with the global information, the average AUC scores are improved by AG-CNN. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 32 条
  • [1] AMM: Towards adaptive ranking of multi-modal documents
    Akbari M.
    Nie L.
    Chua T.-S.
    [J]. International Journal of Multimedia Information Retrieval, 2015, 4 (4) : 233 - 245
  • [2] [Anonymous], 2007, P 6 ACM INT C IMAGE, DOI DOI 10.1145/1282280.1282352
  • [3] [Anonymous], 1997, Neural Computation
  • [4] [Anonymous], 2018, IB C PATT REC
  • [5] Medical Image Analysis using Convolutional Neural Networks: A Review
    Anwar, Syed Muhammad
    Majid, Muhammad
    Qayyum, Adnan
    Awais, Muhammad
    Alnowami, Majdi
    Khan, Muhammad Khurram
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (11)
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] Segmentation and Quantification for Angle-Closure Glaucoma Assessment in Anterior Segment OCT
    Fu, Huazhu
    Xu, Yanwu
    Lin, Stephen
    Zhang, Xiaoqin
    Wong, Damon Wing Kee
    Liu, Jiang
    Frangi, Alejandro F.
    Baskaran, Mani
    Aung, Tin
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (09) : 1930 - 1938
  • [8] Multi-label chest X-ray image classification via category-wise residual attention learning
    Guan, Qingji
    Huang, Yaping
    [J]. PATTERN RECOGNITION LETTERS, 2020, 130 : 259 - 266
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269