Efficient Multiple Organ Localization in CT Image Using 3D Region Proposal Network

被引:92
作者
Xu, Xuanang [1 ]
Zhou, Fugen [1 ,2 ]
Liu, Bo [1 ,2 ]
Fu, Dongshan [2 ]
Bai, Xiangzhi [1 ,2 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Organ localization; CT image; convolutional neural network; region proposal network; MULTIORGAN LOCALIZATION; ANATOMICAL STRUCTURES; REGRESSION FORESTS;
D O I
10.1109/TMI.2019.2894854
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Organ localization is an essential preprocessing step for many medical image analysis tasks, such as image registration, organ segmentation, and lesion detection. In this paper, we propose an efficient method for multiple organ localization in CT image using a 3D region proposal network. Compared with other convolutional neural network-based methods that successively detect the target organs in all slices to assemble the final 3D bounding box, our method is fully implemented in a 3D manner, and thus, it can take full advantages of the spatial context information in CT image to perform efficient organ localization with only one prediction. We also propose a novel backbone network architecture that generates high-resolution feature maps to further improve the localization performance on small organs. We evaluate our method on two clinical datasets, where 11 body organs and 12 head organs (or anatomical structures) are included. As our results shown, the proposed method achieves higher detection precision and localization accuracy than the current state-of-the-art methods with approximate 4 to 18 times faster processing speed. Additionally, we have established a public dataset dedicated for organ localization on http://dx.doi.org/10.21227/df8g-pq27. The full implementation of the proposed method has also been made publicly available on https://github.com/superxuang/caffe_3d_faster_rcnn.
引用
收藏
页码:1885 / 1898
页数:14
相关论文
共 37 条
[21]   Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916
[22]   Efficient organ localization using multi-label convolutional neural networks in thorax-abdomen CT scans [J].
Humpire-Mamani, Gabriel Efrain ;
Setio, Arnaud Arindra Adiyoso ;
van Ginneken, Bram ;
Jacobs, Colin .
PHYSICS IN MEDICINE AND BIOLOGY, 2018, 63 (08)
[23]  
Hussain Mohammad Arafat, 2017, Medical Image Computing and Computer Assisted Intervention MICCAI 2017. 20th International Conference. Proceedings: LNCS 10435, P612, DOI 10.1007/978-3-319-66179-7_70
[24]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[25]   3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare [J].
Kundu, Abhijit ;
Li, Yin ;
Rehg, James M. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3559-3568
[26]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[27]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[28]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[29]   V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation [J].
Milletari, Fausto ;
Navab, Nassir ;
Ahmadi, Seyed-Ahmad .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :565-571
[30]  
Nair V., 2010, P 27 INT C MACH LEAR, P807