RTAU-Net: A novel 3D rectal tumor segmentation model based on dual path fusion and attentional guidance

被引:5
作者
Li, Dengao [1 ,3 ,4 ]
Wang, Juan [1 ,3 ,4 ]
Yang, Jicheng [5 ]
Zhao, Jumin [2 ,3 ,4 ]
Yang, Xiaotang [6 ]
Cui, Yanfen [6 ]
Zhang, Kenan [1 ,3 ,4 ]
机构
[1] Taiyuan Univ Technol, Coll Data Sci, Taiyuan 030024, Peoples R China
[2] Taiyuan Univ Technol, Coll Informat & Comp, Taiyuan 030024, Peoples R China
[3] Taiyuan Univ Technol, Key Lab Big Data Fus Anal & Applicat Shanxi Prov, Taiyuan, Shanxi, Peoples R China
[4] Taiyuan Univ Technol, Intelligent Percept Engn Technol Ctr Shanxi, Taiyuan, Shanxi, Peoples R China
[5] Ocean Univ China, Comp Technol, Qingdao 266100, Peoples R China
[6] Shanxi Med Univ, Shanxi Prov Canc Hosp, Dept Radiol, Taiyuan 030013, Peoples R China
关键词
3D rectal tumor segmentation; Magnetic resonance image; Deep learning; Intelligent diagnosis; Channel attention; Transformer; CANCER;
D O I
10.1016/j.cmpb.2023.107842
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: According to the Global Cancer Statistics 2020, colorectal cancer has the third-highest diagnosis rate (10.0 %) and the second-highest mortality rate (9.4 %) among the 36 types. Rectal cancer accounts for a large proportion of colorectal cancer. The size and shape of the rectal tumor can directly affect the diagnosis and treatment by doctors. The existing rectal tumor segmentation methods are based on two-dimensional slices, which cannot analyze a patient's tumor as a whole and lose the correlation between slices of MRI image, so the practical application value is not high. Methods: In this paper, a three-dimensional rectal tumor segmentation model is proposed. Firstly, image preprocessing is performed to reduce the effect caused by the unbalanced proportion of background region and target region, and improve the quality of the image. Secondly, a dual-path fusion network is designed to extract both global features and local detail features of rectal tumors. The network includes two encoders, a residual encoder for enhancing the spatial detail information and feature representation of the tumor and a transformer encoder for extracting global contour information of the tumor. In the decoding stage, we merge the information extracted from the dual paths and decode them. In addition, for the problem of the complex morphology and different sizes of rectal tumors, a multi-scale fusion channel attention mechanism is designed, which can capture important contextual information of different scales. Finally, visualize the 3D rectal tumor segmentation results. Results: The RTAU-Net is evaluated on the data set provided by Shanxi Provincial Cancer Hospital and Xinhua Hospital. The experimental results showed that the Dice of tumor segmentation reached 0.7978 and 0.6792, respectively, which improved by 2.78 % and 7.02 % compared with suboptimal model. Conclusions: Although the morphology of rectal tumors varies, RTAU-Net can precisely localize rectal tumors and learn the contour and details of tumors, which can relieve physicians' workload and improve diagnostic accuracy.
引用
收藏
页数:16
相关论文
共 45 条
[1]   Efficient 3D Deep Learning Model for Medical Image Semantic Segmentation [J].
Alalwan, Nasser ;
Abozeid, Amr ;
ElHabshy, AbdAllah A. ;
Alzahrani, Ahmed .
ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) :1231-1239
[2]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[3]   Multiresolution Aggregation Transformer UNet Based on Multiscale Input and Coordinate Attention for Medical Image Segmentation [J].
Chen, Shaolong ;
Qiu, Changzhen ;
Yang, Weiping ;
Zhang, Zhiyong .
SENSORS, 2022, 22 (10)
[4]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[5]  
Dosovitskiy A., 2021, arXiv
[6]   MTSE U-Net: an architecture for segmentation, and prediction of fetal brain and gestational age from MRI of brain [J].
Gangopadhyay, Tuhinangshu ;
Halder, Shinjini ;
Dasgupta, Paramik ;
Chatterjee, Kingshuk ;
Ganguly, Debayan ;
Sarkar, Surjadeep ;
Roy, Sudipta .
NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01)
[7]  
Halder S., 2023, Data Management, Analytics and Innovation, P367, DOI 10.1007/978-981-99-1414-2_28
[8]   UNETR: Transformers for 3D Medical Image Segmentation [J].
Hatamizadeh, Ali ;
Tang, Yucheng ;
Nath, Vishwesh ;
Yang, Dong ;
Myronenko, Andriy ;
Landman, Bennett ;
Roth, Holger R. ;
Xu, Daguang .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :1748-1758
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]