TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images

被引:38
作者
Fu, Yinghua [1 ]
Liu, Junfeng [1 ]
Shi, Jun [2 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
关键词
Medical image segmentation; Spatial and channel attention; Transformer; Feature fusion; U-NET; ARCHITECTURE;
D O I
10.1016/j.compbiomed.2024.107938
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning architectures based on convolutional neural network (CNN) and Transformer have achieved great success in medical image segmentation. Models based on the encoder-decoder framework like U -Net have been successfully employed in many realistic scenarios. However, due to the low contrast between object and background, various shapes and scales of objects, and complex background in medical images, it is difficult to locate targets and obtain better segmentation performance by extracting effective information from images. In this paper, an encoder-decoder architecture based on spatial and channel attention modules built by Transformer is proposed for medical image segmentation. Concretely, spatial and channel attention modules based on Transformer are utilized to extract spatial and channel global complementary information at different layers in U -shape network, which is beneficial to learn the detail features in different scales. To fuse better spatial and channel information from Transformer features, a spatial and channel feature fusion block is designed for the decoder. The proposed network inherits the advantages of both CNN and Transformer with the local feature representation and long-range dependency for medical images. Qualitative and quantitative experiments demonstrate that the proposed method outperforms against eight state-of-the-art segmentation methods on five publicly medical image datasets including different modalities, such as 80.23% and 93.56% Dice value, 67.13% and 88.94% Intersection over Union (IoU) value on the Multi -organ Nucleus Segmentation (MoNuSeg) and Combined Healthy Abdominal Organ Segmentation with Computed Tomography scans (CHAOS -CT) datasets.
引用
收藏
页数:16
相关论文
共 42 条
[1]  
Bello I, 2021, Arxiv, DOI arXiv:2102.08602
[2]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[3]  
Chen J., 2021, arXiv, DOI DOI 10.48550/ARXIV.2102.04306
[4]   Learning Active Contour Models for Medical Image Segmentation [J].
Chen, Xu ;
Williams, Bryan M. ;
Vallabhaneni, Srinivasa R. ;
Czanner, Gabriela ;
Williams, Rachel ;
Zheng, Yalin .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11624-11632
[5]   Channel-Unet: A Spatial Channel-Wise Convolutional Neural Network for Liver and Tumors Segmentation [J].
Chen, Yilong ;
Wang, Kai ;
Liao, Xiangyun ;
Qian, Yinling ;
Wang, Qiong ;
Yuan, Zhiyong ;
Heng, Pheng-Ann .
FRONTIERS IN GENETICS, 2019, 10
[6]  
Dosovitskiy Alexey, 2021, ICLR
[7]   Fovea localization by blood vessel vector in abnormal fundus images [J].
Fu, Yinghua ;
Zhang, Ge ;
Li, Jiang ;
Pan, Dongyan ;
Wang, Yongxiong ;
Zhang, Dawei .
PATTERN RECOGNITION, 2022, 129
[8]   Optic disc segmentation by U-net and probability bubble in abnormal fundus images [J].
Fu, Yinghua ;
Chen, Jie ;
Li, Jiang ;
Pan, Dongyan ;
Yue, Xuezheng ;
Zhu, Yiming .
PATTERN RECOGNITION, 2021, 117
[9]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71
[10]   Effective integration of object boundaries and regions for improving the performance of medical image segmentation by using two cascaded networks [J].
Guo, Wei ;
Zhang, Guodong ;
Gong, Zhaoxuan ;
Li, Qiang .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 212