TransSea: Hybrid CNN-Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation

被引:8
|
作者
Liu, Yu [1 ,2 ]
Ma, Yize [1 ,2 ]
Zhu, Zhiqin [3 ]
Cheng, Juan [1 ,2 ]
Chen, Xun [4 ]
机构
[1] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China
[2] Hefei Univ Technol, Anhui Prov Key Lab Measuring Theory & Precis Instr, Hefei 230009, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[4] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Brain tumor segmentation; convolutional neural networks (CNNs); multimodal magnetic resonance imaging (MRI); semantic guidance (SG); Transformer; U-NET; ATTENTION;
D O I
10.1109/TIM.2024.3413130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate segmentation of brain tumors in multimodal magnetic resonance imaging (MRI) plays a crucial role in clinical quantitative assessments, diagnostic processes, and the planning of therapeutic strategies. Both convolutional neural networks (CNNs) with strong local information extraction capacities and Transformers with excellent global representation capacities have achieved remarkable performance in medical image segmentation. However, considering the inherent semantic disparities between local and global features, effectively combining convolutions and Transformers presents a significant challenge in medical image segmentation. To address this issue, through integrating the merits of these two paradigms in a well-designed encoder-decoder architecture, we propose a hybrid CNN-Transformer network with semantic awareness, named TransSea, for an accurate 3-D brain tumor segmentation task. Our network incorporates a semantic mutual attention (SMA) module at the encoding stage, seamlessly integrating global and local features. Furthermore, our design includes a multiscale semantic guidance (SG) module that introduces semantic priors in the encoder through semantic supervision, enabling focused segmentation in relevant areas. In the decoding process, a semantic integration (SI) module is presented to further integrate various feature mappings from the encoder and semantic priors, thereby enhancing the propagation of semantic information and achieving semantically aware querying. Extensive experiments on two brain tumor datasets, BraTS2020 and BraTS2021, demonstrate that our model significantly outperforms existing state-of-the-art methods. The source code of the proposed method will be made available at https://github.com/yuliu316316.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Hybrid CNN-Transformer model for medical image segmentation with pyramid convolution and multi-layer perceptron
    Liu, Xiaowei
    Hu, Yikun
    Chen, Jianguo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [22] A 3D-2D Hybrid Network with Regional Awareness and Global Fusion for Brain Tumor Segmentation
    Zhao, Wenxiu
    Dongye, Changlei
    Wang, Yumei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 333 - 344
  • [23] Shape-Scale Co-Awareness Network for 3D Brain Tumor Segmentation
    Zhou, Lifang
    Jiang, Yu
    Li, Weisheng
    Hu, Jun
    Zheng, Shenhai
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (07) : 2495 - 2508
  • [24] Enhanced Segmentation in Abdominal CT Images: Leveraging Hybrid CNN-Transformer Architectures and Compound Loss Function
    Piri, Fatemeh
    Karimi, Nader
    Samavi, Shadrokh
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0363 - 0369
  • [25] CNN-transformer dual branch collaborative model for semantic segmentation of high-resolution remote sensing images
    Zhu, Xiaotong
    Peng, Taile
    Guo, Jia
    Wang, Hao
    Cao, Taotao
    PHOTOGRAMMETRIC RECORD, 2025, 40 (189):
  • [26] UTNETPARA: A HYBRID CNN-TRANSFORMER ARCHITECTURE WITH MULTI-SCALE FUSION FOR WHOLE-SLIDE IMAGE SEGMENTATION
    Huang, Boqiang
    Ying, Jiayu
    Lyu, Ruizhi
    Schaadt, Nadine S.
    Klinkhammer, Barbara M.
    Boor, Peter
    Lotz, Johannes
    Feuerhake, Friedrich
    Merhof, Dorit
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [27] Multi-resolution 3D CNN for MRI Brain Tumor Segmentation and Survival Prediction
    Amian, Mehdi
    Soltaninejad, Mohammadreza
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 221 - 230
  • [28] SegTransConv: Transformer and CNN Hybrid Method for Real-Time Semantic Segmentation of Autonomous Vehicles
    Fan, Jiaqi
    Gao, Bingzhao
    Ge, Quanbo
    Ran, Yabing
    Zhang, Jia
    Chu, Hongqing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1586 - 1601
  • [29] TRANSFORMER AND CNN HYBRID NETWORK FOR SUPER-RESOLUTION SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGERY
    Liu, Yutong
    Gao, Kun
    Wang, Hong
    Wang, Junwei
    Zhang, Xiaodian
    Wang, Pengyu
    Li, Shuzhong
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6940 - 6943
  • [30] MFH-Net: A Hybrid CNN-Transformer Network Based Multi-Scale Fusion for Medical Image Segmentation
    Wang, Ying
    Zhang, Meng
    Liang, Jian'an
    Liang, Meiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (06)