MM-UNet: A novel cross-attention mechanism between modules and scales for brain tumor segmentation

被引:2
作者
Lin, Chih-Wei [1 ]
Chen, Zhongsheng [1 ]
机构
[1] Fujian Agr & Forestry Univ, Coll Comp & Informat Sci, Fuzhou 350002, Fujian, Peoples R China
关键词
Multi-scale fusion; Non-local attention; Cross-attention; Brain tumor segmentation; Multi-modules; U-NET; ARCHITECTURE; TRANSFORMER;
D O I
10.1016/j.engappai.2024.108591
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Brain tumor segmentation is an essential issue in medical image segmentation. However, it is still challenging to consider the relationship between modules and efficiently fuse the features between adjacent scales. In this paper, we propose a novel cross-attention network for brain tumor segmentation, namely multi-scales and multi-modules cross-attention UNet (MM-UNet), which contains two mechanisms, module and scale cross- attentions. The module cross-attention (MCA) MCA ) strategy connects and exchanges global information between adjacent modules. The scale cross-attention (SCA) SCA ) strategy has two policies, the scale-related non-local relationship module ( SCA SNR ) and the scale-related channel-based relationship module ( SCA SCR ), that fuses the information between adjacent scales to mix the multi-scale information. Experiments on well-known tumor datasets, BraTS 2020, which has 369 cases, and has been classified into training, validation, and testing sets with 17,576, 4395, and 5735 images, to evaluate the performance by segmenting three regions, the whole tumor area (WT), core tumor area (CT) and enhancing tumor area (ET). Moreover, we consider three numerical metrics, dice, precision, and Hausdorff metrics, and various visualization results to objectively evaluate and intuitively display the experimental results. The proposed model surpasses state-of-the-art methods and achieves 0.8519, 0.8889, and 1.2647 with a base version network in dice, precision, sensitivity, and Hausdorff metrics, respectively. Moreover, we demonstrate the visualization with segmentation results and heatmaps in various scenarios to present the robustness of the proposed network in each region.
引用
收藏
页数:20
相关论文
共 52 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]  
Bakas S, 2019, Arxiv, DOI arXiv:1811.02629
[3]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[4]   A survey of MRI-based medical image analysis for brain tumor studies [J].
Bauer, Stefan ;
Wiest, Roland ;
Nolte, Lutz-P ;
Reyes, Mauricio .
PHYSICS IN MEDICINE AND BIOLOGY, 2013, 58 (13) :R97-R129
[5]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[6]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[7]  
Chen J., 2021, arXiv
[8]  
Dosovitskiy A., 2021, 9 INT C LEARN REPR I
[9]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[10]   UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation [J].
Gao, Yunhe ;
Zhou, Mu ;
Metaxas, Dimitris N. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 :61-71