Mutual Information Guided Diffusion for Zero-Shot Cross-Modality Medical Image Translation

被引：11

作者：

Wang, Zihao ^{[1
,2
]}

Yang, Yingyu ^{[3
]}

Chen, Yuzhou ^{[4
]}

Yuan, Tingting ^{[5
]}

Sermesant, Maxime ^{[3
]}

Delingette, Herve ^{[3
]}

Wu, Ona ^{[1
,2
]}

机构：

[1] Massachusetts Gen Hosp, Athinoula A Martinos Ctr Biomed Imaging, Boston, MA 02129 USA

[2] Harvard Univ, Boston, MA 02129 USA

[3] Univ Cote Azur, Inria Ctr, F-06902 Valbonne, France

[4] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA

[5] Georg August Univ Gottingen, Inst Comp Sci, D-37073 Gottingen, Germany

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2024年 / 43卷 / 08期

基金：

美国国家卫生研究院;

关键词：

Task analysis; Mutual information; Training; Magnetic resonance imaging; Computational modeling; Mathematical models; Generative adversarial networks; Zero-shot learning; cross-modality translation; diffusion model; mutual information; GENERATION; MR;

D O I：

10.1109/TMI.2024.3382043

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Cross-modality data translation has attracted great interest in medical image computing. Deep generative models show performance improvement in addressing related challenges. Nevertheless, as a fundamental challenge in image translation, the problem of zero-shot learning cross-modality image translation with fidelity remains unanswered. To bridge this gap, we propose a novel unsupervised zero-shot learning method called Mutual Information guided Diffusion Model, which learns to translate an unseen source image to the target modality by leveraging the inherent statistical consistency of Mutual Information between different modalities. To overcome the prohibitive high dimensional Mutual Information calculation, we propose a differentiable local-wise mutual information layer for conditioning the iterative denoising process. The Local-wise-Mutual-Information-Layer captures identical cross-modality features in the statistical domain, offering diffusion guidance without relying on direct mappings between the source and target domains. This advantage allows our method to adapt to changing source domains without the need for retraining, making it highly practical when sufficient labeled source domain data is not available. We demonstrate the superior performance of MIDiffusion in zero-shot cross-modality translation tasks through empirical comparisons with other generative models, including adversarial-based and diffusion-based models. Finally, we showcase the real-world application of MIDiffusion in 3D zero-shot learning-based cross-modality image segmentation tasks.

引用

页码：2825 / 2838

页数：14

共 88 条

[1] Image2StyleGAN++: How to Edit the Embedded Images? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8293-8302

[2]

ANDERSON BD, 1982, STOCH PROC APPL, V12, P313, DOI DOI 10.1016/0304-4149(82)90051-5

[3] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation [J].

Arar, Moab ;

Ginger, Yiftach ;

Danon, Dov ;

Bermano, Amit H. ;

Cohen-Or, Daniel .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13407-13416

[4] MedGAN: Medical image translation using GANs [J].

Armanious, Karim ;

Jiang, Chenming ;

Fischer, Marc ;

Kuestner, Thomas ;

Nikolaou, Konstantin ;

Gatidis, Sergios ;

Yang, Bin .

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2020, 79

[5] Unsupervised Domain Adaptation by Domain Invariant Projection [J].

Baktashmotlagh, Mahsa ;

Harandi, Mehrtash T. ;

Lovell, Brian C. ;

Salzmann, Mathieu .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :769-776

[6]

Batzolis G, 2022, Arxiv, DOI arXiv:2207.09786

[7] Robust machine learning segmentation for large-scale analysis of heterogeneous clinical brain MRI datasets [J].

Billot, Benjamin ;

Magdamo, Colin ;

Cheng, You ;

Arnold, Steven E. ;

Das, Sudeshna ;

Iglesias, Juan Eugenio .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (09)

[8]

Brock Andrew, 2017, ARXIV170805344

[9]

Chaudhuri U., 2020, P IEEE CVPRW OCT, P182

[10] General Image-to-Image Translation with One-Shot Image Guidance [J].

Cheng, Bin ;

Liu, Zuhao ;

Peng, Yunbo ;

Lin, Yue .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :22679-22689

← 1 2 3 4 5 6 7 8 9 →