ISC-TRANSUNET: MEDICAL IMAGE SEGMENTATION NETWORK BASED ON THE INTEGRATION OF SELF-ATTENTION AND CONVOLUTION

被引:1
作者
Li, Fang [1 ]
Pei, Siyu [1 ]
Zhang, Ziqun [2 ]
Yang, Fuming [3 ]
机构
[1] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai 201306, Peoples R China
[2] Fudan Univ, Informat Off, Shanghai 200433, Peoples R China
[3] Shanghai Gen Hosp, Dept Neurosurg, Shanghai 200025, Peoples R China
关键词
Medical image segmentation; transformer; self-attention; feature fusion; lightweight;
D O I
10.1142/S0219519423401073
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
In the current medical image segmentation network, the combination of CNN and Transformer has become a mainstream trend. However, the inherent limitations of convolution operation in CNN and insufficient information interaction in Transformer affect the segmentation performance of the network. To solve these problems, an integrated self-attention and convolution medical image segmentation network (ISC-TransUNet) is proposed in this paper. The network consists of encoder, decoder and jump connection. First, the encoder uses a hybrid structure of BoTNet and Transformer to capture more comprehensive image information and reduce additional computing overhead. Then, the decoder uses an upper sampler cascaded by multiple DUpsampling upper blocks to accurately recover the pixel-level prediction. Finally, the feature fusion of encoder and decoder at different resolutions is realized by ResPath jump connection, which reduces the semantic difference between encoder and decoder. Through experiments on the Synapse multi-organ segmentation dataset, compared with the baseline model TransUNet, Dice similarity coefficient of ISC-TransUNet was improved by 1.13%, Hausdorff distance was reduced by 2.38%, and weight was maintained. The experimental results show that the network can effectively segment tissues and organs in medical images, which has important theoretical significance and application value for intelligent clinical diagnosis and treatment.
引用
收藏
页数:13
相关论文
共 20 条
[1]  
Chang Y, 2021, ARXIV
[2]  
Chen J., ARXIV
[3]  
Dosovitskiy A., 2021, P INT C LEARN REPR, DOI DOI 10.48550/ARXIV.2010.11929
[4]   Fully Dense UNet for 2-D Sparse Photoacoustic Tomography Artifact Removal [J].
Guan, Steven ;
Khan, Amir A. ;
Sikdar, Siddhartha ;
Chitnis, Parag V. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (02) :568-576
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]   Deep Learning Techniques for Medical Image Segmentation: Achievements and Challenges [J].
Hesamian, Mohammad Hesam ;
Jia, Wenjing ;
He, Xiangjian ;
Kennedy, Paul .
JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) :582-596
[7]  
Huang HM, 2020, INT CONF ACOUST SPEE, P1055, DOI [10.1109/ICASSP40776.2020.9053405, 10.1109/icassp40776.2020.9053405]
[8]   A Review of Deep-Learning-Based Medical Image Segmentation Methods [J].
Liu, Xiangbin ;
Song, Liping ;
Liu, Shuai ;
Zhang, Yudong .
SUSTAINABILITY, 2021, 13 (03) :1-29
[9]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[10]   V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation [J].
Milletari, Fausto ;
Navab, Nassir ;
Ahmadi, Seyed-Ahmad .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :565-571