Multi-teacher cross-modal distillation with cooperative deep supervision fusion learning for unimodal segmentation

被引：8

作者：

Ahmad, Saeed ^{[1
]}

Ullah, Zahid ^{[1
]}

Gwak, Jeonghwan ^{[1
,2
,3
,4
]}

机构：

[1] Korea Natl Univ Transportat, Dept Software, Chungju 27469, South Korea

[2] Korea Natl Univ Transportat, Dept IT Energy Convergence BK21 FOUR, Chungju 27469, South Korea

[3] Korea Natl Univ Transportat, Dept Biomed Engn, Chungju 27469, South Korea

[4] Korea Natl Univ Transportat, Dept AI Robot Engn, Chungju 27469, South Korea

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 297卷

基金：

新加坡国家研究基金会;

关键词：

Brain tumor segmentation; Knowledge distillation; Cooperative learning; Feature fusion; Multi-teacher framework; SEMANTICS;

D O I：

10.1016/j.knosys.2024.111854

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurate brain tumor segmentation is a labor-intensive and time-consuming task that requires automation to enhance its efficacy. Recent advanced techniques have shown promising results in segmenting brain tumors; however, their dependency on extensive multimodal magnetic resonance imaging (MRI) data limits their practicality in clinical environments where such data may not be readily available. To address this, we propose a novel multi-teacher cross-modal knowledge distillation framework, which utilizes the privileged multimodal data during training while relying solely on unimodal data for inference. Our framework is tailored to the unimodal segmentation of the T 1ce MRI sequence, which is prevalently available in clinical practice and structurally akin to the T1 1 modality, providing ample information for the segmentation task. Our framework introduces two learning strategies for knowledge distillation (KD): (1) performance-aware response-based KD and (2) cooperative deep supervision fusion learning (CDSFL). The first strategy involves dynamically assigning confidence weights to each teacher model based on its performance, ensuring that the KD is performance- driven, and the CDSFL module augments the learning capabilities of the multi-teacher models by fostering mutual learning. Moreover, the fused information is distilled into the student model to improve its deep supervision. Extensive experiments on BraTS datasets demonstrate that our framework achieves promising unimodal segmentation results on the T 1ce and T1 1 modalities and outperforms previous state-of-the-art methods. Code is available at https://github.com/ami-lab-knut/mtcm_kd.

引用

页数：13

共 49 条

[1]

Azad R, 2022, PR MACH LEARN RES, V172, P48

[2] Synthesis of Positron Emission Tomography (PET) Images via Multi-channel Generative Adversarial Networks (GANs) [J].

Bi, Lei ;

Kim, Jinman ;

Kumar, Ashnil ;

Feng, Dagan ;

Fulham, Michael .

MOLECULAR IMAGING, RECONSTRUCTION AND ANALYSIS OF MOVING BODY ORGANS, AND STROKE IMAGING AND TREATMENT, 2017, 10555 :43-51

[3]

Bucilua Cristian, 2006, PROC 12 ACM SIGKDD I, P535, DOI DOI 10.1145/1150402.1150464

[4] MRI segmentation fusion for brain tumor detection [J].

Cabria, Ivan ;

Gondra, Iker .

INFORMATION FUSION, 2017, 36 :1-9

[5] Learning With Privileged Multimodal Knowledge for Unimodal Segmentation [J].

Chen, Cheng ;

Dou, Qi ;

Jin, Yueming ;

Liu, Quande ;

Heng, Pheng Ann .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (03) :621-632

[6] A single stage knowledge distillation network for brain tumor segmentation on limited MR image modalities [J].

Choi, Yoonseok ;

Al-masni, Mohammed A. ;

Jung, Kyu-Jin ;

Yoo, Roh-Eul ;

Lee, Seong-Yeong ;

Kim, Dong-Hyun .

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 240

[7] Image Synthesis in Multi-Contrast MRI With Conditional Generative Adversarial Networks [J].

Dar, Salman U. H. ;

Yurt, Mahmut ;

Karacan, Levent ;

Erdem, Aykut ;

Erdem, Erkut ;

Cukur, Tolga .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (10) :2375-2388

[8] RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor Segmentation [J].

Ding, Yuhang ;

Yu, Xin ;

Yang, Yi .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3955-3964

[9] Hetero-Modal Variational Encoder-Decoder for Joint Modality Completion and Segmentation [J].

Dorent, Reuben ;

Joutard, Samuel ;

Modat, Marc ;

Ourselin, Sebastien ;

Vercauteren, Tom .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :74-82

[10] Unpaired Multi-Modal Segmentation via Knowledge Distillation [J].

Dou, Qi ;

Liu, Quande ;

Heng, Pheng Ann ;

Glocker, Ben .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (07) :2415-2425

← 1 2 3 4 5 →