TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images

被引:38
作者
Fu, Yinghua [1 ]
Liu, Junfeng [1 ]
Shi, Jun [2 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
关键词
Medical image segmentation; Spatial and channel attention; Transformer; Feature fusion; U-NET; ARCHITECTURE;
D O I
10.1016/j.compbiomed.2024.107938
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning architectures based on convolutional neural network (CNN) and Transformer have achieved great success in medical image segmentation. Models based on the encoder-decoder framework like U -Net have been successfully employed in many realistic scenarios. However, due to the low contrast between object and background, various shapes and scales of objects, and complex background in medical images, it is difficult to locate targets and obtain better segmentation performance by extracting effective information from images. In this paper, an encoder-decoder architecture based on spatial and channel attention modules built by Transformer is proposed for medical image segmentation. Concretely, spatial and channel attention modules based on Transformer are utilized to extract spatial and channel global complementary information at different layers in U -shape network, which is beneficial to learn the detail features in different scales. To fuse better spatial and channel information from Transformer features, a spatial and channel feature fusion block is designed for the decoder. The proposed network inherits the advantages of both CNN and Transformer with the local feature representation and long-range dependency for medical images. Qualitative and quantitative experiments demonstrate that the proposed method outperforms against eight state-of-the-art segmentation methods on five publicly medical image datasets including different modalities, such as 80.23% and 93.56% Dice value, 67.13% and 88.94% Intersection over Union (IoU) value on the Multi -organ Nucleus Segmentation (MoNuSeg) and Combined Healthy Abdominal Organ Segmentation with Computed Tomography scans (CHAOS -CT) datasets.
引用
收藏
页数:16
相关论文
共 42 条
[21]   A survey on deep learning in medical image analysis [J].
Litjens, Geert ;
Kooi, Thijs ;
Bejnordi, Babak Ehteshami ;
Setio, Arnaud Arindra Adiyoso ;
Ciompi, Francesco ;
Ghafoorian, Mohsen ;
van der Laak, Jeroen A. W. M. ;
van Ginneken, Bram ;
Sanchez, Clara I. .
MEDICAL IMAGE ANALYSIS, 2017, 42 :60-88
[22]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[23]   Understanding adversarial attacks on deep learning based medical image analysis systems [J].
Ma, Xingjun ;
Niu, Yuhao ;
Gu, Lin ;
Yisen, Wang ;
Zhao, Yitian ;
Bailey, James ;
Lu, Feng .
PATTERN RECOGNITION, 2021, 110
[24]   Segmentation of Nuclei in Histopathology Images by Deep Regression of the Distance Map [J].
Naylor, Peter ;
Lae, Marick ;
Reyal, Fabien ;
Walter, Thomas .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (02) :448-459
[25]  
Oktay O., 2018, ARXIV
[26]   Medical Image Segmentation via Cascaded Attention Decoding [J].
Rahman, Md Mostafijur ;
Marculescu, Radu .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :6211-6220
[27]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[28]   Exploring Brushlet Based 3D Textures in Transfer Function Specification for Direct Volume Rendering of Abdominal Organs [J].
Selver, M. Alper .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (02) :174-187
[29]   Segmentation of abdominal organs from CT using a multi-level, hierarchical neural network strategy [J].
Selver, M. Alper .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2014, 113 (03) :830-852
[30]  
Selvi E, 2015, J FAC ENG ARCHIT GAZ, V30, P533