TransSea: Hybrid CNN-Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation

被引:8
|
作者
Liu, Yu [1 ,2 ]
Ma, Yize [1 ,2 ]
Zhu, Zhiqin [3 ]
Cheng, Juan [1 ,2 ]
Chen, Xun [4 ]
机构
[1] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Anhui Prov Key Lab Measuring Theory & Precis Instr, Hefei 230009, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[4] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Brain tumor segmentation; convolutional neural networks (CNNs); multimodal magnetic resonance imaging (MRI); semantic guidance (SG); Transformer; U-NET; ATTENTION;
D O I
10.1109/TIM.2024.3413130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of brain tumors in multimodal magnetic resonance imaging (MRI) plays a crucial role in clinical quantitative assessments, diagnostic processes, and the planning of therapeutic strategies. Both convolutional neural networks (CNNs) with strong local information extraction capacities and Transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, considering the inherent semantic disparities between local and global features, effectively combining convolutions and Transformers presents a significant challenge in medical image segmentation. To address this issue, through integrating the merits of these two paradigms in a well-designed encoder-decoder architecture, we propose a hybrid CNN-Transformer network with semantic awareness, named TransSea, for an accurate 3-D brain tumor segmentation task. Our network incorporates a semantic mutual attention (SMA) module at the encoding stage, seamlessly integrating global and local features. Furthermore, our design includes a multiscale semantic guidance (SG) module that introduces semantic priors in the encoder through semantic supervision, enabling focused segmentation in relevant areas. In the decoding process, a semantic integration (SI) module is presented to further integrate various feature mappings from the encoder and semantic priors, thereby enhancing the propagation of semantic information and achieving semantically aware querying. Extensive experiments on two brain tumor datasets, BraTS2020 and BraTS2021, demonstrate that our model significantly outperforms existing state-of-the-art methods. The source code of the proposed method will be made available at https://github.com/yuliu316316.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Multi-scale Masked 3-D U-Net for Brain Tumor Segmentation
    Xu, Yanwu
    Gong, Mingming
    Fu, Huan
    Tao, Dacheng
    Zhang, Kun
    Batmanghelich, Kayhan
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 : 222 - 233
  • [32] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
    Yu, Zhihong
    Lee, Feifei
    Chen, Qiu
    APPLIED INTELLIGENCE, 2023, 53 (17) : 19990 - 20006
  • [33] HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation
    Zhihong Yu
    Feifei Lee
    Qiu Chen
    Applied Intelligence, 2023, 53 : 19990 - 20006
  • [34] Hybrid Window Attention Based Transformer Architecture for Brain Tumor Segmentation
    Peiris, Himashi
    Hayat, Munawar
    Chen, Zhaolin
    Egan, Gary
    Harandi, Mehrtash
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2022, PT II, 2023, 14092 : 173 - 182
  • [35] TransDoubleU-Net: Dual Scale Swin Transformer With Dual Level Decoder for 3D Multimodal Brain Tumor Segmentation
    Vatanpour, Marjan
    Haddadnia, Javad
    IEEE ACCESS, 2023, 11 : 125511 - 125518
  • [36] Medical Transformer: Universal Encoder for 3-D Brain MRI Analysis
    Jun, Eunji
    Jeong, Seungwoo
    Heo, Da-Woon
    Suk, Heung-Il
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (12) : 1 - 11
  • [37] SwinBTS: A Method for 3D Multimodal Brain Tumor Segmentation Using Swin Transformer
    Jiang, Yun
    Zhang, Yuan
    Lin, Xin
    Dong, Jinkun
    Cheng, Tongtong
    Liang, Jing
    BRAIN SCIENCES, 2022, 12 (06)
  • [38] DiffSwinTr: A diffusion model using 3D Swin Transformer for brain tumor segmentation
    Zhu, Junan
    Zhu, Hongxin
    Jia, Zhaohong
    Ma, Ping
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)
  • [39] AST-Net: Lightweight Hybrid Transformer for Multimodal Brain Tumor Segmentation
    Wang, Peixu
    Liu, Shikun
    Peng, Jialin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4623 - 4629
  • [40] 3DUV-NetR+: A 3D hybrid semantic architecture using transformers for brain tumor segmentation with MultiModal MR images
    Aboussaleh, Ilyasse
    Riffi, Jamal
    el Fazazy, Khalid
    Mahraz, Adnane Mohamed
    Tairi, Hamid
    RESULTS IN ENGINEERING, 2024, 21