Multimodal invariant feature prompt network for brain tumor segmentation with modalities

被引:2
作者
Diao, Yueqin [1 ,2 ,4 ]
Fang, Huihui [3 ]
Yu, Hanyi [2 ]
Li, Fan [1 ]
Xu, Yanwu [2 ,4 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
[2] South China Univ Technol, Sch Future Technol, Guangzhou 510006, Peoples R China
[3] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 639798, Singapore
[4] Pazhou Lab, Guangzhou 510320, Peoples R China
关键词
Brain tumor segmentation; Incomplete multimodal; Modality feature prompts; Modality-aware fusion; COMPLETION;
D O I
10.1016/j.neucom.2024.128847
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate brain tumor image segmentation can significantly assist doctors in clinical diagnosis and treatment. The complementary information from multimodal magnetic resonance imaging provides more comprehensive information on brain tumors for lesion segmentation. However, the complexity of clinical scenarios may lead to modal loss or damage, thereby reducing segmentation performance. Therefore, brain tumor image segmentation becomes a challenging task in the absence of modalities. In recent years, leveraging interactive mechanisms to learn shared feature representations across multiple modalities has become a dominant approach, especially for enhancing segmentation accuracy in scenarios with incomplete modalities. However, a significant challenge in most existing methods is their inability to effectively capture complete multimodal information due to the loss of modality-specific details during interaction processes. To overcome this challenge, we introduce a novel network, referred to as the Multimodal Invariant Feature Prompt Network (MIFPN), which enhances learning by incorporating modality prompts within multimodal interactions, thus ensuring a more comprehensive acquisition of modality information. This network aims to introduce modality- invariant feature prompts in modal information interaction, guiding the model to learn the missing modality information and facilitating integration. To fully leverage modal information, we learned both modality- invariant and modality-specific information, and fused them during the prompt learning process to obtain amore comprehensive understanding of modal information. Considering the different contributions of various modalities to the segmentation results, we designed modality-aware masks and modality selection strategies to merge shallow features of the encoder. MIFPN effectively mitigates the brain tumor image segmentation problem under missing modalities by learning from missing modality prompts and modality-aware fusion. Extensive experiments conducted on the BraTS2020 and BraTS2018 datasets show that our method outperforms state-of-the-art methods inmost scenarios across 15 missing modality configurations. The code is available at https://github.com/diaoyq121/MIFPN.
引用
收藏
页数:13
相关论文
共 52 条
[1]  
Azad R, 2022, PR MACH LEARN RES, V172, P48
[2]  
Bray F, 2018, CA-CANCER J CLIN, V68, P394, DOI [10.3322/caac.21609, 10.3322/caac.21492]
[3]  
Brown TB, 2020, ADV NEUR IN, V33
[4]   Robust Multimodal Brain Tumor Segmentation via Feature Disentanglement and Gated Fusion [J].
Chen, Cheng ;
Dou, Qi ;
Jin, Yueming ;
Chen, Hao ;
Qin, Jing ;
Pheng-Ann Heng .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT III, 2019, 11766 :447-456
[5]   Joint learning-based feature reconstruction and enhanced network for incomplete multi-modal brain tumor segmentation [J].
Diao, Yueqin ;
Li, Fan ;
Li, Zhiyuan .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
[6]   RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation [J].
Ding, Yuhang ;
Yu, Xin ;
Yang, Yi .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3955-3964
[7]   Hetero-Modal Variational Encoder-Decoder for Joint Modality Completion and Segmentation [J].
Dorent, Reuben ;
Joutard, Samuel ;
Modat, Marc ;
Ourselin, Sebastien ;
Vercauteren, Tom .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :74-82
[8]   GP-GAN: Brain tumor growth prediction using stacked 3D generative adversarial networks from longitudinal MR Images [J].
Elazab, Ahmed ;
Wang, Changmiao ;
Gardezi, Syed Jamal Safdar ;
Bai, Hongmin ;
Hu, Qingmao ;
Wang, Tianfu ;
Chang, Chunqi ;
Lei, Baiying .
NEURAL NETWORKS, 2020, 132 :321-332
[9]   DAUnet: A U-shaped network combining deep supervision and attention for brain tumor segmentation [J].
Feng, Yan ;
Cao, Yuan ;
An, Dianlong ;
Liu, Panpan ;
Liao, Xingyu ;
Yu, Bin .
KNOWLEDGE-BASED SYSTEMS, 2024, 285
[10]   UNETR: Transformers for 3D Medical Image Segmentation [J].
Hatamizadeh, Ali ;
Tang, Yucheng ;
Nath, Vishwesh ;
Yang, Dong ;
Myronenko, Andriy ;
Landman, Bennett ;
Roth, Holger R. ;
Xu, Daguang .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :1748-1758