Cascaded Multi-path Shortcut Diffusion Model for Medical Image Translation

被引：1

作者：

Zhou, Yinchi ^{[1
]}

Chen, Tianqi ^{[4
]}

Hou, Jun ^{[4
]}

Xie, Huidong ^{[1
]}

Dvornek, Nicha C. ^{[1
,2
]}

Zhou, S. Kevin ^{[5
,6
]}

Wilson, David L. ^{[7
]}

Duncan, James S. ^{[1
,2
,3
]}

Liu, Chi ^{[1
,2
]}

Zhou, Bo ^{[8
]}

机构：

[1] Yale Univ, Dept Biomed Engn, New Haven, CT USA

[2] Yale Sch Med, Dept Radiol & Biomed Imaging, New Haven, CT USA

[3] Yale Univ, Dept Elect Engn, New Haven, CT USA

[4] Univ Calif Irvine, Dept Comp Sci, Irvine, CA USA

[5] Univ Sci & Technol China, Sch Biomed Engn, Suzhou, Peoples R China

[6] Univ Sci & Technol China, Suzhou Inst Adv Res, Suzhou, Peoples R China

[7] Case Western Reserve Univ, Dept Biomed Engn, Cleveland Hts, OH USA

[8] Northwestern Univ, Dept Radiol, Chicago, IL 60208 USA

来源：

MEDICAL IMAGE ANALYSIS | 2024年 / 98卷

基金：

美国国家卫生研究院;

关键词：

Image translation; Diffusion model; Uncertainty; Cascade framework; CONVOLUTIONAL NEURAL-NETWORKS;

D O I：

10.1016/j.media.2024.103300

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image-to-image translation is a vital component in medical imaging processing, with many uses in a wide range of imaging modalities and clinical scenarios. Previous methods include Generative Adversarial Networks (GANs) and Diffusion Models (DMs), which offer realism but suffer from instability and lack uncertainty estimation. Even though both GAN and DM methods have individually exhibited their capability in medical image translation tasks, the potential of combining a GAN and DM to further improve translation performance and to enable uncertainty estimation remains largely unexplored. In this work, we address these challenges by proposing a Cascade Multi-path Shortcut Diffusion Model (CMDM) for high-quality medical image translation and uncertainty estimation. To reduce the required number of iterations and ensure robust performance, our method first obtains a conditional GAN-generated prior image that will be used for the efficient reverse translation with a DM in the subsequent step. Additionally, a multi-path shortcut diffusion strategy is employed to refine translation results and estimate uncertainty. A cascaded pipeline further enhances translation quality, incorporating residual averaging between cascades. We collected three different medical image datasets with two sub-tasks for each dataset to test the generalizability of our approach. Our experimental results found that CMDM can produce high-quality translations comparable to state-of-the-art methods while providing reasonable uncertainty estimations that correlate well with the translation error.

引用

页数：9

共 49 条

[1]

Bieder F., 2024, Medical Imaging With Deep Learning, P552

[2]

Chen TQ, 2024, Arxiv, DOI arXiv:2406.08374

[3] Cross-vender, cross-tracer, and cross-protocol deep transfer learning for attenuation map generation of cardiac SPECT [J].