TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images

被引:48
作者
Liang, Junjie [1 ]
Yang, Cihui [1 ]
Zeng, Mengjie [1 ]
Wang, Xixi [1 ]
机构
[1] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghenan Rd, Nanchang 330063, Jiangxi, Peoples R China
关键词
Brain tumor segmentation; transformer; convolution; cross-attention; local and global semantic information;
D O I
10.21037/qims-21-919
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Medical image segmentation plays a vital role in computer-aided diagnosis (CAD) systems. Both convolutional neural networks (CNNs) with strong local information extraction capacities and transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, because of the semantic differences between local and global features, how to combine convolution and transformers effectively is an important challenge in medical image segmentation. Methods: In this paper, we proposed TransConver, a U-shaped segmentation network based on convolution and transformer for automatic and accurate brain tumor segmentation in MRI images. Unlike the recently proposed transformer and convolution based models, we proposed a parallel module named transformer-convolution inception (TC-inception), which extracts local and global information via convolution blocks and transformer blocks, respectively, and integrates them by a cross-attention fusion with global and local feature (CAFGL) mechanism. Meanwhile, the improved skip connection structure named skip connection with cross-attention fusion (SCCAF) mechanism can alleviate the semantic differences between encoder features and decoder features for better feature fusion. In addition, we designed 2D-TransConver and 3D-TransConver for 2D and 3D brain tumor segmentation tasks, respectively, and verified the performance and advantage of our model through brain tumor datasets. Results: We trained our model on 335 cases from the training dataset of MICCAI BraTS2019 and evaluated the model's performance based on 66 cases from MICCAI BraTS2018 and 125 cases from MICCAI BraTS2019. Our TransConver achieved the best average Dice score of 83.72% and 86.32% on BraTS2019 and BraTS2018, respectively. Conclusions: We proposed a transformer and convolution parallel network named TransConver for brain tumor segmentation. The TC-Inception module effectively extracts global information while retaining local details. The experimental results demonstrated that good segmentation requires the model to extract local fine-grained details and global semantic information simultaneously, and our TransConver effectively improves the accuracy of brain tumor segmentation.
引用
收藏
页码:2397 / 2415
页数:19
相关论文
共 44 条
[11]  
Dosovitskiy A, 2020, ARXIV
[12]   Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering [J].
Gao, Peng ;
Jiang, Zhengkai ;
You, Haoxuan ;
Lu, Pan ;
Hoi, Steven ;
Wang, Xiaogang ;
Li, Hongsheng .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6632-6641
[13]  
Glorot X., 2011, P 14 INT C ART INT S, V15, P315
[14]   Conformer: Convolution-augmented Transformer for Speech Recognition [J].
Gulati, Anmol ;
Qin, James ;
Chiu, Chung-Cheng ;
Parmar, Niki ;
Zhang, Yu ;
Yu, Jiahui ;
Han, Wei ;
Wang, Shibo ;
Zhang, Zhengdong ;
Wu, Yonghui ;
Pang, Ruoming .
INTERSPEECH 2020, 2020, :5036-5040
[15]  
Guo M-H, 2021, ARXIV PREPRINT ARXIV
[16]  
Hatamizadeh A, 2021, ARXIV PREPRINT ARXIV
[17]   COMPARING IMAGES USING THE HAUSDORFF DISTANCE [J].
HUTTENLOCHER, DP ;
KLANDERMAN, GA ;
RUCKLIDGE, WJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (09) :850-863
[18]   Two-Stage Cascaded U-Net: 1st Place Solution to BraTS Challenge 2019 Segmentation Task [J].
Jiang, Zeyu ;
Ding, Changxing ;
Liu, Minfeng ;
Tao, Dacheng .
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 :231-241
[19]   Automatic segmentation of the left ventricle in echocardiographic images using convolutional neural networks [J].
Kim, Taeouk ;
Hedayat, Mohammadali ;
Vaitkus, Veronica V. ;
Belohlavek, Marek ;
Krishnamurthy, Vinayak ;
Borazjani, Iman .
QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2021, 11 (05) :1763-1781
[20]   Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity [J].
Li, Shuangli ;
Zhou, Jingbo ;
Xu, Tong ;
Huang, Liang ;
Wang, Fan ;
Xiong, Haoyi ;
Huang, Weili ;
Dou, Dejing ;
Xiong, Hui .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :975-985