Spatio-temporal context based recurrent visual attention model for lymph node detection

被引:2
作者
Peng, Haixin [1 ]
Peng, Yinjun [1 ,2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Shandong Prov Key Lab Wisdom Min Informat Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
biomedical image classification; false-positive reduction; mixture density networks; recurrent visual attention; CONVOLUTIONAL NEURAL-NETWORKS; AUTOMATIC DETECTION; SEGMENTATION; CNN;
D O I
10.1002/ima.22430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
False-positive reduction is one of the most crucial components in an automated lymph nodes (LNs) detection task in volumetric computed tomography (CT) scans, which is a highly sought goal for cancer diagnosis and early treatment. In this article, treating the three-dimensional (3D) LN detection task as object detection on sequence problem, we propose a novel spatio-temporal context-based recurrent visual attention model (STRAM) for the LNs false positive reduction. We firstly extract the deep spatial features maps for two-dimensional LN patches from pre-trained Inception-V3 model. A new Gaussian kernel-based spatial attention method is then presented to extract the most discriminating spatial features for the corresponding center slices. Additionally, to combine the temporal information between 3D CT slices, we devise a novel "Siamese" mixture density networks which can learn to adaptively focus on the most relevant parts of the CT slices. Considering the lesion areas always locate around the centroid of the 3D CT scans, a hard constraint is imposed on the predicted attention locations with batch normalization technique and the Siamese architecture. The proposed model is a fully differentiable unit that can be optimized end-to-end by using stochastic gradient descent. The effectiveness of our method is verified on LN dataset: 388 mediastinal LNs labeled by radiologists in 90 patient CT scans, and 595 abdominal LNs in 86 patient CT scans. Our method demonstrates sensitivities of about 87%/82% at 3 FP/vol. and 93%/89% at 6 FP/vol. for mediastinum and abdomen, respectively, which compares favorably to previous methods.
引用
收藏
页码:1220 / 1242
页数:23
相关论文
共 55 条
  • [41] Improving Computer-Aided Detection Using Convolutional Neural Networks and Random View Aggregation
    Roth, Holger R.
    Lu, Le
    Liu, Jiamin
    Yao, Jianhua
    Seff, Ari
    Cherry, Kevin
    Kim, Lauren
    Summers, Ronald M.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1170 - 1181
  • [42] Leveraging Mid-Level Semantic Boundary Cues for Automated Lymph Node Detection
    Seff, Ari
    Lu, Le
    Barbu, Adrian
    Roth, Holger
    Shin, Hoo-Chang
    Summers, Ronald M.
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2015, PT II, 2015, 9350 : 53 - 61
  • [43] Seff A, 2014, LECT NOTES COMPUT SC, V8673, P544, DOI 10.1007/978-3-319-10404-1_68
  • [44] Pulmonary Nodule Detection in CT Images: False Positive Reduction Using Multi-View Convolutional Networks
    Setio, Arnaud Arindra Adiyoso
    Ciompi, Francesco
    Litjens, Geert
    Gerke, Paul
    Jacobs, Colin
    van Riel, Sarah J.
    Wille, Mathilde Marie Winkler
    Naqibullah, Matiullah
    Sanchez, Clara I.
    van Ginneken, Bram
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1160 - 1169
  • [45] Sharma Shikhar, 2015, CoRR
  • [46] Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
    Shin, Hoo-Chang
    Roth, Holger R.
    Gao, Mingchen
    Lu, Le
    Xu, Ziyue
    Nogues, Isabella
    Yao, Jianhua
    Mollura, Daniel
    Summers, Ronald M.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1285 - 1298
  • [47] Sonderby S. K., 2015, ARXIV150905329
  • [48] Sutskever I, 2014, ADV NEUR IN, V27
  • [49] Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation
    Tseng, Kuan-Lun
    Lin, Yen-Liang
    Hsu, Winston
    Huang, Chung-Yang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3739 - 3746
  • [50] Convolutional Networks Can Learn to Generate Affinity Graphs for Image Segmentation
    Turaga, Srinivas C.
    Murray, Joseph F.
    Jain, Viren
    Roth, Fabian
    Helmstaedter, Moritz
    Briggman, Kevin
    Denk, Winfried
    Seung, H. Sebastian
    [J]. NEURAL COMPUTATION, 2010, 22 (02) : 511 - 538