CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization

被引:26
作者
Gao, Qi [1 ,2 ,3 ]
Li, Zilong [4 ]
Zhang, Junping [4 ]
Zhang, Yi [5 ]
Shan, Hongming [1 ,2 ,3 ]
机构
[1] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200433, Peoples R China
[2] Fudan Univ, MOE Frontiers Ctr Brain Sci, Shanghai 200433, Peoples R China
[3] Shanghai Ctr Brain Sci & Brain Inspired Technol, Shanghai 201602, Peoples R China
[4] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[5] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610065, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Low-dose CT; denoising; diffusion model; one-shot learning; COMPUTED-TOMOGRAPHY; IMAGE; NETWORK; PERFORMANCE; RECONSTRUCTION; REDUCTION;
D O I
10.1109/TMI.2023.3320812
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Low-dose computed tomography (CT) images suffer from noise and artifacts due to photon starvation and electronic noise. Recently, some works have attempted to use diffusion models to address the over-smoothness and training instability encountered by previous deep-learning-based denoising models. However, diffusion models suffer from long inference time due to a large number of sampling steps involved. Very recently, cold diffusion model generalizes classical diffusion models and has greater flexibility. Inspired by cold diffusion, this paper presents a novel COntextual eRror-modulated gEneralized Diffusion model for low-dose CT (LDCT) denoising, termed CoreDiff. First, CoreDiff utilizes LDCT images to displace the random Gaussian noise and employs a novel mean-preserving degradation operator to mimic the physical process of CT degradation, significantly reducing sampling steps thanks to the informative LDCT images as the starting point of the sampling process. Second, to alleviate the error accumulation problem caused by the imperfect restoration operator in the sampling process, we propose a novel ContextuaL Error-modulAted Restoration Network (CLEAR-Net), which can leverage contextual information to constrain the sampling process from structural distortion and modulate time step embedding features for better alignment with the input at the next time step. Third, to rapidly generalize the trained model to a new, unseen dose level with as few resources as possible, we devise a one-shot learning framework to make CoreDiff generalize faster and better using only one single LDCT image (un)paired with normal-dose CT (NDCT). Extensive experimental results on four datasets demonstrate that our CoreDiff outperforms competing methods in denoising and generalization performance, with clinically acceptable inference time. Source code is made available at https://github.com/qgao21/CoreDiff.
引用
收藏
页码:745 / 759
页数:15
相关论文
共 64 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] Bansal A, 2023, ADV NEUR IN
  • [3] An Open Library of CT Patient Projection Data
    Chen, Baiyu
    Leng, Shuai
    Yu, Lifeng
    Holmes, David, III
    Fletcher, Joel
    McCollough, Cynthia
    [J]. MEDICAL IMAGING 2016: PHYSICS OF MEDICAL IMAGING, 2016, 9783
  • [4] Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network
    Chen, Hu
    Zhang, Yi
    Kalra, Mannudeep K.
    Lin, Feng
    Chen, Yang
    Liao, Peixi
    Zhou, Jiliu
    Wang, Ge
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) : 2524 - 2535
  • [5] Improving abdomen tumor low-dose CT images using a fast dictionary learning based processing
    Chen, Yang
    Yin, Xindao
    Shi, Luyao
    Shu, Huazhong
    Luo, Limin
    Coatrieux, Jean-Louis
    Toumoulin, Christine
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2013, 58 (16) : 5803 - 5820
  • [6] Chung H., 2021, P 36 C NEUR INF PROC, P1
  • [7] Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models
    Chung, Hyungjin
    Ryu, Dohoon
    Mccann, Michael T.
    Klasky, Marc L.
    Ye, Jong Chul
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22542 - 22551
  • [8] Image quality assessment based on a degradation model
    Damera-Venkata, N
    Kite, TD
    Geisler, WS
    Evans, BL
    Bovik, AC
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (04) : 636 - 650
  • [9] Dhariwal P, 2021, ADV NEUR IN, V34
  • [10] Dumoulin Vincent, 2017, ICLR