LoMAE: Simple Streamlined Low-Level Masked Autoencoders for Robust, Generalized, and Interpretable Low-Dose CT Denoising

被引:1
|
作者
Wang, Dayang [1 ]
Han, Shuo [1 ]
Xu, Yongshun [1 ]
Wu, Zhan [2 ,3 ]
Zhou, Li [1 ]
Morovati, Bahareh [1 ]
Yu, Hengyong [1 ]
机构
[1] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01854 USA
[2] Southeast Univ, Lab Image Sci & Technol, Nanjing 210096, Peoples R China
[3] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing 210096, Peoples R China
关键词
Noise reduction; Noise; Transformers; Computed tomography; Decoding; Robustness; Data models; Low-dose CT; masked autoencoder; self-pretraining; transformer; RECONSTRUCTION; ALGORITHMS; NETWORK;
D O I
10.1109/JBHI.2024.3454979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low-dose computed tomography (LDCT) offers reduced X-ray radiation exposure but at the cost of compromised image quality, characterized by increased noise and artifacts. Recently, transformer models emerged as a promising avenue to enhance LDCT image quality. However, the success of such models relies on a large amount of paired noisy and clean images, which are often scarce in clinical settings. In computer vision and natural language processing, masked autoencoders (MAE) have been recognized as a powerful self-pretraining method for transformers, due to their exceptional capability to extract representative features. However, the original pretraining and fine-tuning design fails to work in low-level vision tasks like denoising. In response to this challenge, we redesign the classical encoder-decoder learning model and facilitate a simple yet effective streamlined low-level vision MAE, referred to as LoMAE, tailored to address the LDCT denoising problem. Moreover, we introduce an MAE-GradCAM method to shed light on the latent learning mechanisms of the MAE/LoMAE. Additionally, we explore the LoMAE's robustness and generability across a variety of noise levels. Experimental findings show that the proposed LoMAE enhances the denoising capabilities of the transformer and substantially reduce their dependency on high-quality, ground-truth data. It also demonstrates remarkable robustness and generalizability over a spectrum of noise levels. In summary, the proposed LoMAE provides promising solutions to the major issues in LDCT including interpretability, ground truth data dependency, and model robustness/generalizability.
引用
收藏
页码:6815 / 6827
页数:13
相关论文
共 50 条
  • [1] Masked Autoencoders for Low-dose CT Denoising
    Wang, Dayang
    Xu, Yongshun
    Han, Shuo
    Yu, Hengyong
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [2] Low-dose CT restoration via stacked sparse denoising autoencoders
    Liu, Yan
    Zhang, Yi
    NEUROCOMPUTING, 2018, 284 : 80 - 89
  • [3] Robust Denoising of Low-Dose CT Images using Convolutional Neural Networks
    Nguyen Thanh Trung
    Trinh Dinh Hoan
    Nguyen Linh Trung
    Luu Manh Ha
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 506 - 511
  • [4] Low-Dose CT Denoising via Sinogram Inner-Structure Transformer
    Yang, Liutao
    Li, Zhongnian
    Ge, Rongjun
    Zhao, Junyong
    Si, Haipeng
    Zhang, Daoqiang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (04) : 910 - 921
  • [5] RHLNet: Robust Hybrid Loss-Based Network for Low-Dose CT Image Denoising
    Saidulu, Naragoni
    Muduli, Priya Ranjan
    Dasgupta, Anirban
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [6] A Review of deep learning methods for denoising of medical low-dose CT images
    Zhang, Ju
    Gong, Weiwei
    Ye, Lieli
    Wang, Fanghong
    Shangguan, Zhibo
    Cheng, Yun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 171
  • [7] LOW-DOSE CT DENOISING WITH CONVOLUTIONAL NEUELA NETWORK
    Chen, Hu
    Zhang, Yi
    Zhang, Weihua
    Liao, Peixi
    Li, Ke
    Zhou, Jiliu
    Wang, Ge
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 143 - 146
  • [8] CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization
    Gao, Qi
    Li, Zilong
    Zhang, Junping
    Zhang, Yi
    Shan, Hongming
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (02) : 745 - 759
  • [9] SwinCT: feature enhancement based low-dose CT images denoising with swin transformer
    Jian, Muwei
    Yu, Xiaoyang
    Zhang, Haoran
    Yang, Chengdong
    MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [10] AdaIN-Based Tunable CycleGAN for Efficient Unsupervised Low-Dose CT Denoising
    Gu, Jawook
    Ye, Jong Chul
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 : 73 - 85