Contextual embedding learning to enhance 2D networks for volumetric image segmentation

被引:0
作者
Wang, Zhuoyuan [1 ,2 ,3 ]
Sun, Dong [4 ]
Zeng, Xiangyun [1 ,2 ,3 ]
Wu, Ruodai [5 ]
Wang, Yi [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Sch Biomed Engn, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Shenzhen, Peoples R China
[2] Smart Med Imaging Learning & Engn SMILE Lab, Shenzhen, Peoples R China
[3] Med UltraSound Image Comp MUS Lab, Shenzhen, Peoples R China
[4] Huawei Cloud Comp Technol Co Ltd, Shenzhen, Peoples R China
[5] Shenzhen Univ, Dept Radiol, Gen Hosp, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Convolutional neural networks; Embedding learning; Contextual information; Attention mechanism; PROSTATE SEGMENTATION; ARCHITECTURE; MRI;
D O I
10.1016/j.eswa.2024.124279
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The segmentation of organs in volumetric medical images plays an important role in computer-aided diagnosis and treatment/surgery planning. Conventional 2D convolutional neural networks (CNNs) can hardly exploit the spatial correlation of volumetric data. Current 3D CNNs have the advantage to extract more powerful volumetric representations but they usually suffer from occupying excessive memory and computation nevertheless. In this study we aim to enhance the 2D networks with contextual information for better volumetric image segmentation. Accordingly, we propose a contextual embedding learning approach to facilitate 2D CNNs capturing spatial information properly. Our approach leverages the learned embedding and the slicewisely neighboring matching as a soft cue to guide the network. In such a way, the contextual information can be transferred slice-by-slice thus boosting the volumetric representation of the network. Experiments on challenging prostate MRI dataset (PROMISE12) and abdominal CT dataset (CHAOS) show that our contextual embedding learning can effectively leverage the inter-slice context and improve segmentation performance. The proposed approach is a plug-and-play, and memory-efficient solution to enhance the 2D networks for volumetric segmentation. Our code is publicly available at https://github.com/JuliusWang-7/CE_Block.
引用
收藏
页数:10
相关论文
共 54 条
[1]   Dual Cross-Attention for medical image segmentation [J].
Ates, Gorkem Can ;
Mohan, Prasoon ;
Celik, Emrah .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[2]   VoxResNet: Deep voxelwise residual networks for brain segmentation from 3D MR images [J].
Chen, Hao ;
Dou, Qi ;
Yu, Lequan ;
Qin, Jing ;
Heng, Pheng-Ann .
NEUROIMAGE, 2018, 170 :446-455
[3]   DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation [J].
Chen, Hao ;
Qi, Xiaojuan ;
Yu, Lequan ;
Heng, Pheng-Ann .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2487-2496
[4]  
Chen Jianxu, 2016, Advances in Neural Information Processing Systems, V29
[5]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]
[6]   Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning [J].
Chen, Yuhua ;
Pont-Tuset, Jordi ;
Montes, Alberto ;
Van Gool, Luc .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1189-1198
[7]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[8]   FlowNet: Learning Optical Flow with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Fischer, Philipp ;
Ilg, Eddy ;
Haeusser, Philip ;
Hazirbas, Caner ;
Golkov, Vladimir ;
van der Smagt, Patrick ;
Cremers, Daniel ;
Brox, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766
[9]  
Fathi A, 2017, Arxiv, DOI arXiv:1703.10277
[10]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71