CSU-Net: A CNN-Transformer Parallel Network for Multimodal Brain Tumour Segmentation

被引:17
作者
Chen, Yu [1 ]
Yin, Ming [2 ]
Li, Yu [2 ]
Cai, Qian [2 ]
机构
[1] Hubei Univ Educ, Coll Comp, Wuhan 430205, Peoples R China
[2] Wuhan Text Univ, Sch Elect & Elect Engn, Wuhan 430200, Peoples R China
关键词
brain tumour segmentation; multimodal MRI; CNN; volumetric transformer;
D O I
10.3390/electronics11142226
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation techniques are vital to medical image processing and analysis. Considering the significant clinical applications of brain tumour image segmentation, it represents a focal point of medical image segmentation research. Most of the work in recent times has been centred on Convolutional Neural Networks (CNN) and Transformers. However, CNN has some deficiencies in modelling long-distance information transfer and contextual processing information, while Transformer is relatively weak in acquiring local information. To overcome the above defects, we propose a novel segmentation network with an "encoder-decoder" architecture, namely CSU-Net. The encoder consists of two parallel feature extraction branches based on CNN and Transformer, respectively, in which the features of the same size are fused. The decoder has a dual Swin Transformer decoder block with two learnable parameters for feature upsampling. The features from multiple resolutions in the encoder and decoder are merged via skip connections. On the BraTS 2020, our model achieves 0.8927, 0.8857, and 0.8188 for the Whole Tumour (WT), Tumour Core (TC), and Enhancing Tumour (ET), respectively, in terms of Dice scores.
引用
收藏
页数:12
相关论文
共 31 条
[1]  
Bakas S, 2019, Arxiv, DOI arXiv:1811.02629
[2]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[3]  
Cao H., 2021, arXiv
[4]  
Chen J., 2021, arXiv, DOI 10.48550/arXiv:2102.04306
[5]  
Chen L, 2015, P INT C LEARNING REP
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Cicek Ozgun, 2016, Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016. 19th International Conference. Proceedings: LNCS 9901, P424, DOI 10.1007/978-3-319-46723-8_49
[8]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[9]   Image Denoising Using a Compressive Sensing Approach Based on Regularization Constraints [J].
El Mahdaoui, Assia ;
Ouahabi, Abdeldjalil ;
Moulay, Mohamed Said .
SENSORS, 2022, 22 (06)
[10]  
Ghaffari M., 2020, Revised Selected Papers, P481