Cascaded Multi-path Shortcut Diffusion Model for Medical Image Translation

被引:1
作者
Zhou, Yinchi [1 ]
Chen, Tianqi [4 ]
Hou, Jun [4 ]
Xie, Huidong [1 ]
Dvornek, Nicha C. [1 ,2 ]
Zhou, S. Kevin [5 ,6 ]
Wilson, David L. [7 ]
Duncan, James S. [1 ,2 ,3 ]
Liu, Chi [1 ,2 ]
Zhou, Bo [8 ]
机构
[1] Yale Univ, Dept Biomed Engn, New Haven, CT USA
[2] Yale Sch Med, Dept Radiol & Biomed Imaging, New Haven, CT USA
[3] Yale Univ, Dept Elect Engn, New Haven, CT USA
[4] Univ Calif Irvine, Dept Comp Sci, Irvine, CA USA
[5] Univ Sci & Technol China, Sch Biomed Engn, Suzhou, Peoples R China
[6] Univ Sci & Technol China, Suzhou Inst Adv Res, Suzhou, Peoples R China
[7] Case Western Reserve Univ, Dept Biomed Engn, Cleveland Hts, OH USA
[8] Northwestern Univ, Dept Radiol, Chicago, IL 60208 USA
基金
美国国家卫生研究院;
关键词
Image translation; Diffusion model; Uncertainty; Cascade framework; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1016/j.media.2024.103300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-to-image translation is a vital component in medical imaging processing, with many uses in a wide range of imaging modalities and clinical scenarios. Previous methods include Generative Adversarial Networks (GANs) and Diffusion Models (DMs), which offer realism but suffer from instability and lack uncertainty estimation. Even though both GAN and DM methods have individually exhibited their capability in medical image translation tasks, the potential of combining a GAN and DM to further improve translation performance and to enable uncertainty estimation remains largely unexplored. In this work, we address these challenges by proposing a Cascade Multi-path Shortcut Diffusion Model (CMDM) for high-quality medical image translation and uncertainty estimation. To reduce the required number of iterations and ensure robust performance, our method first obtains a conditional GAN-generated prior image that will be used for the efficient reverse translation with a DM in the subsequent step. Additionally, a multi-path shortcut diffusion strategy is employed to refine translation results and estimate uncertainty. A cascaded pipeline further enhances translation quality, incorporating residual averaging between cascades. We collected three different medical image datasets with two sub-tasks for each dataset to test the generalizability of our approach. Our experimental results found that CMDM can produce high-quality translations comparable to state-of-the-art methods while providing reasonable uncertainty estimations that correlate well with the translation error.
引用
收藏
页数:9
相关论文
共 49 条
[1]  
Bieder F., 2024, Medical Imaging With Deep Learning, P552
[2]  
Chen TQ, 2024, Arxiv, DOI arXiv:2406.08374
[3]   Cross-vender, cross-tracer, and cross-protocol deep transfer learning for attenuation map generation of cardiac SPECT [J].
Chen, Xiongchao ;
Hendrik Pretorius, P. ;
Zhou, Bo ;
Liu, Hui ;
Johnson, Karen ;
Liu, Yi-Hwa ;
King, Michael A. ;
Liu, Chi .
JOURNAL OF NUCLEAR CARDIOLOGY, 2022, 29 (06) :3379-3391
[4]   Direct and indirect strategies of deep-learning-based attenuation correction for general purpose and dedicated cardiac SPECT [J].
Chen, Xiongchao ;
Zhou, Bo ;
Xie, Huidong ;
Shi, Luyao ;
Liu, Hui ;
Holler, Wolfgang ;
Lin, MingDe ;
Liu, Yi-Hwa ;
Miller, Edward J. ;
Sinusas, Albert J. ;
Liu, Chi .
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2022, 49 (09) :3046-3060
[5]   Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models [J].
Chung, Hyungjin ;
Ryu, Dohoon ;
Mccann, Michael T. ;
Klasky, Marc L. ;
Ye, Jong Chul .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22542-22551
[6]   Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction [J].
Chung, Hyungjin ;
Sim, Byeongsu ;
Ye, Jong Chul .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12403-12412
[7]   MR-contrast-aware image-to-image translations with generative adversarial networks [J].
Denck, Jonas ;
Guehring, Jens ;
Maier, Andreas ;
Rothgang, Eva .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (12) :2069-2078
[8]  
Gal Y, 2016, PR MACH LEARN RES, V48
[9]   CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization [J].
Gao, Qi ;
Li, Zilong ;
Zhang, Junping ;
Zhang, Yi ;
Shan, Hongming .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (02) :745-759
[10]   PET image denoising based on denoising diffusion probabilistic model [J].
Gong, Kuang ;
Johnson, Keith ;
El Fakhri, Georges ;
Li, Quanzheng ;
Pan, Tinsu .
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2024, 51 (02) :358-368