TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images

被引:48
作者
Liang, Junjie [1 ]
Yang, Cihui [1 ]
Zeng, Mengjie [1 ]
Wang, Xixi [1 ]
机构
[1] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghenan Rd, Nanchang 330063, Jiangxi, Peoples R China
关键词
Brain tumor segmentation; transformer; convolution; cross-attention; local and global semantic information;
D O I
10.21037/qims-21-919
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Medical image segmentation plays a vital role in computer-aided diagnosis (CAD) systems. Both convolutional neural networks (CNNs) with strong local information extraction capacities and transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, because of the semantic differences between local and global features, how to combine convolution and transformers effectively is an important challenge in medical image segmentation. Methods: In this paper, we proposed TransConver, a U-shaped segmentation network based on convolution and transformer for automatic and accurate brain tumor segmentation in MRI images. Unlike the recently proposed transformer and convolution based models, we proposed a parallel module named transformer-convolution inception (TC-inception), which extracts local and global information via convolution blocks and transformer blocks, respectively, and integrates them by a cross-attention fusion with global and local feature (CAFGL) mechanism. Meanwhile, the improved skip connection structure named skip connection with cross-attention fusion (SCCAF) mechanism can alleviate the semantic differences between encoder features and decoder features for better feature fusion. In addition, we designed 2D-TransConver and 3D-TransConver for 2D and 3D brain tumor segmentation tasks, respectively, and verified the performance and advantage of our model through brain tumor datasets. Results: We trained our model on 335 cases from the training dataset of MICCAI BraTS2019 and evaluated the model's performance based on 66 cases from MICCAI BraTS2018 and 125 cases from MICCAI BraTS2019. Our TransConver achieved the best average Dice score of 83.72% and 86.32% on BraTS2019 and BraTS2018, respectively. Conclusions: We proposed a transformer and convolution parallel network named TransConver for brain tumor segmentation. The TC-Inception module effectively extracts global information while retaining local details. The experimental results demonstrated that good segmentation requires the model to extract local fine-grained details and global semantic information simultaneously, and our TransConver effectively improves the accuracy of brain tumor segmentation.
引用
收藏
页码:2397 / 2415
页数:19
相关论文
共 44 条
[1]   A Spatial Guided Self-supervised Clustering Network for Medical Image Segmentation [J].
Ahn, Euijoon ;
Feng, Dagan ;
Kim, Jinman .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 :379-388
[2]  
[Anonymous], 2018, Medical Imaging with Deep Learning
[3]  
Bakas S., 2018, ARXIV PREPRINT ARXIV
[4]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[5]  
Brox T., 2016, INT C MED IM COMP CO, P424
[6]   Dense-UNet: a novel multiphoton in vivo cellular image segmentation model based on a convolutional neural network [J].
Cai, Sijing ;
Tian, Yunxian ;
Lui, Harvey ;
Zeng, Haishan ;
Wu, Yi ;
Chen, Guannan .
QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2020, 10 (06) :1275-1285
[7]   Learning Delicate Local Representations for Multi-person Pose Estimation [J].
Cai, Yuanhao ;
Wang, Zhicheng ;
Luo, Zhengxiong ;
Yin, Binyi ;
Du, Angang ;
Wang, Haoqian ;
Zhang, Xiangyu ;
Zhou, Xinyu ;
Zhou, Erjin ;
Sun, Jian .
COMPUTER VISION - ECCV 2020, PT III, 2020, 12348 :455-472
[8]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[9]  
Chen J., 2021, arXiv preprint arXiv:210204306, P04306
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848