Atten-Nonlocal Unet: Attention and Non-local Unet for medical image segmentation

被引:0
作者
Jia, Xiaofen [1 ]
Wang, Wenjie [2 ]
Zhang, Mei [3 ]
Zhao, Baiting [2 ]
机构
[1] School of Artificial Intelligence, Anhui University of Science and Technology, Huainan
[2] School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan
[3] Sleep Medicine Center in High-tech District Hospital and Department of Neurology, First Affiliated Hospital of Anhui University of Science and Technology, First People's Hospital of Huainan, Huainan
基金
中国国家自然科学基金;
关键词
Attention mechanism; Convolutional neural network; Long-range information; Medical image segmentation; Transformer;
D O I
10.1016/j.compbiomed.2025.110129
中图分类号
学科分类号
摘要
The convolutional neural network(CNN)-based models have emerged as the predominant approach for medical image segmentation due to their effective inductive bias. However, their limitation lies in the lack of long-range information. In this study, we propose the Atten-Nonlocal Unet model that integrates CNN and transformer to overcome this limitation and precisely capture global context in 2D features. Specifically, we utilize the BCSM attention module and the Cross Non-local module to enhance feature representation, thereby improving the segmentation accuracy. Experimental results on the Synapse, ACDC, and AVT datasets show that Atten-Nonlocal Unet achieves DSC scores of 84.15%, 91.57%, and 86.94% respectively, and has 95% HD of 15.17, 1.16, and 4.78 correspondingly. Compared to the existing methods for medical image segmentation, the proposed method demonstrates superior segmentation performance, ensuring high accuracy in segmenting large organs while improving segmentation for small organs. © 2025 Elsevier Ltd
引用
收藏
相关论文
共 39 条
[1]  
Wang R., Lei T., Cui R., Zhang B., Meng H., Nandi A.K., Medical image segmentation using deep learning: A survey, IET Image Process., 16, 5, pp. 1243-1267, (2022)
[2]  
Shamshad F., Khan S., Zamir S.W., Khan M.H., Hayat M., Khan F.S., Fu H., Transformers in medical imaging: A survey, Med. Image Anal., (2023)
[3]  
Ronneberger O., Fischer P., Brox T., U-net: Convolutional networks for biomedical image segmentation, Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pp. 234-241, (2015)
[4]  
Howard A.G., Zhu M., Chen B., Kalenichenko D., Wang W., Weyand T., Andreetto M., Adam H., Mobilenets: Efficient convolutional neural networks for mobile vision applications, (2017)
[5]  
Dai J., Qi H., Xiong Y., Li Y., Zhang G., Hu H., Wei Y., Deformable convolutional networks, pp. 764-773, (2017)
[6]  
Ye Y., Zhang J., Chen Z., Xia Y., CADS: A self-supervised learner via cross-modal alignment and deep self-distillation for CT volume segmentation, IEEE Trans. Med. Imaging, 44, 1, pp. 118-129, (2025)
[7]  
Simonyan K., Zisserman A., Very deep convolutional networks for large-scale image recognition, (2014)
[8]  
He K., Zhang X., Ren S., Sun J., Deep residual learning for image recognition, pp. 770-778, (2016)
[9]  
Zhu Z., He X., Qi G., Li Y., Cong B., Liu Y., Brain tumor segmentation based on the fusion of deep semantics and edge information in multimodal MRI, Inf. Fusion, 91, pp. 376-387, (2023)
[10]  
Hu J., Shen L., Sun G., pp. 7132-7141, (2018)