Prostate lesion segmentation based on a 3D end-to-end convolution neural network with deep multi-scale attention

被引:15
作者
Song, Enmin [1 ]
Long, Jiaosong [1 ]
Ma, Guangzhi [1 ]
Liu, Hong [1 ]
Hung, Chih-Cheng [2 ]
Jin, Renchao [1 ]
Wang, Peijun [3 ]
Wang, Wei [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Kennesaw State Univ, Coll Comp & Software Engn, Atlanta, GA USA
[3] Tongji Univ, Tongji Hosp, Sch Medcine, Dept Radiol, Shanghai 200065, Peoples R China
基金
中国国家自然科学基金;
关键词
Mp-MRI; Prostate cancer segmentation; Convolution neural network; Attention; COMPUTER-AIDED DIAGNOSIS; SUPPORT VECTOR MACHINES; GLEASON SCORE; MR-IMAGES; CANCER;
D O I
10.1016/j.mri.2023.01.015
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Prostate cancer is one of the deadest cancers among human beings. To better diagnose the prostate cancer, prostate lesion segmentation becomes a very important work, but its progress is very slow due to the prostate lesions small in size, irregular in shape, and blurred in contour. Therefore, automatic prostate lesion segmentation from mp-MRI is a great significant work and a challenging task. However, the most existing multi-step segmentation methods based on voxel-level classification are time-consuming, may introduce errors in different steps and lead to error accumulation. To decrease the computation time, harness richer 3D spatial features, and fuse the multi-level contextual information of mp-MRI, we present an automatic segmentation method in which all steps are optimized conjointly as one step to form our end-to-end convolutional neural network. The proposed end-to-end network DMSA-V-Net consists of two parts: (1) a 3D V-Net is used as the backbone network, it is the first attempt in employing 3D convolutional neural network for CS prostate lesion segmentation, (2) a deep multi-scale attention mechanism is introduced into the 3D V-Net which can highly focus on the ROI while suppressing the redundant background. As a merit, the attention can adaptively re-align the context information between the feature maps at different scales and the saliency maps in high-levels. We performed experiments based on five cross-fold validation with data including 97 patients. The results show that the Dice and sensitivity are 0.7014 and 0.8652 respectively, which demonstrates that our segmentation approach is more significant and accurate compared to other methods.
引用
收藏
页码:98 / 109
页数:12
相关论文
共 50 条
[1]   Detecting prostate cancer using deep learning convolution neural network with transfer learning approach [J].
Abbasi, Adeel Ahmed ;
Hussain, Lal ;
Awan, Imtiaz Ahmed ;
Abbasi, Imran ;
Majid, Abdul ;
Nadeem, Malik Sajjad Ahmed ;
Chaudhary, Quratul-Ain .
COGNITIVE NEURODYNAMICS, 2020, 14 (04) :523-533
[2]  
[Anonymous], 2017, IEEE C COMPUTER VISI, DOI [10.1109/CVPR.2017.667, DOI 10.1109/CVPR.2017.667]
[3]  
[Anonymous], 2018, 24 INT C PATTERN REC, DOI [10.1109/ICPR.2018.8545754, DOI 10.1109/ICPR.2018.8545754]
[4]   Prostate Cancer Localization With Multispectral MRI Using Cost-Sensitive Support Vector Machines and Conditional Random Fields [J].
Artan, Yusuf ;
Haider, Masoom A. ;
Langer, Deanna L. ;
van der Kwast, Theodorus H. ;
Evans, Andrew J. ;
Yang, Yongyi ;
Wernick, Miles N. ;
Trachtenberg, John ;
Yetik, Imam Samil .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (09) :2444-2455
[5]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[6]  
Cao RM, 2019, I S BIOMED IMAGING, P1900, DOI [10.1109/ISBI.2019.8759584, 10.1109/isbi.2019.8759584]
[7]   Predicting Gleason Score of Prostate Cancer Patients Using Radiomic Analysis [J].
Chaddad, Ahmad ;
Niazi, Tamim ;
Probst, Stephan ;
Bladou, Franck ;
Anidjar, Maurice ;
Bahoric, Boris .
FRONTIERS IN ONCOLOGY, 2018, 8
[8]   Multimodal Radiomic Features for the Predicting Gleason Score of Prostate Cancer [J].
Chaddad, Ahmad ;
Kucharczyk, Michael J. ;
Niazi, Tamim .
CANCERS, 2018, 10 (08)
[9]   Detection of prostate cancer by integration of line-scan diffusion, T2-mapping and T2-weighted magnetic resonance imaging; a multichannel statistical classifier [J].
Chan, I ;
Wells, W ;
Mulkern, RV ;
Haker, S ;
Zhang, JQ ;
Zou, KH ;
Maier, SE ;
Tempany, CMC .
MEDICAL PHYSICS, 2003, 30 (09) :2390-2398
[10]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851