LoMAE: Simple Streamlined Low-Level Masked Autoencoders for Robust, Generalized, and Interpretable Low-Dose CT Denoising

被引:1
|
作者
Wang, Dayang [1 ]
Han, Shuo [1 ]
Xu, Yongshun [1 ]
Wu, Zhan [2 ,3 ]
Zhou, Li [1 ]
Morovati, Bahareh [1 ]
Yu, Hengyong [1 ]
机构
[1] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01854 USA
[2] Southeast Univ, Lab Image Sci & Technol, Nanjing 210096, Peoples R China
[3] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing 210096, Peoples R China
关键词
Noise reduction; Noise; Transformers; Computed tomography; Decoding; Robustness; Data models; Low-dose CT; masked autoencoder; self-pretraining; transformer; RECONSTRUCTION; ALGORITHMS; NETWORK;
D O I
10.1109/JBHI.2024.3454979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low-dose computed tomography (LDCT) offers reduced X-ray radiation exposure but at the cost of compromised image quality, characterized by increased noise and artifacts. Recently, transformer models emerged as a promising avenue to enhance LDCT image quality. However, the success of such models relies on a large amount of paired noisy and clean images, which are often scarce in clinical settings. In computer vision and natural language processing, masked autoencoders (MAE) have been recognized as a powerful self-pretraining method for transformers, due to their exceptional capability to extract representative features. However, the original pretraining and fine-tuning design fails to work in low-level vision tasks like denoising. In response to this challenge, we redesign the classical encoder-decoder learning model and facilitate a simple yet effective streamlined low-level vision MAE, referred to as LoMAE, tailored to address the LDCT denoising problem. Moreover, we introduce an MAE-GradCAM method to shed light on the latent learning mechanisms of the MAE/LoMAE. Additionally, we explore the LoMAE's robustness and generability across a variety of noise levels. Experimental findings show that the proposed LoMAE enhances the denoising capabilities of the transformer and substantially reduce their dependency on high-quality, ground-truth data. It also demonstrates remarkable robustness and generalizability over a spectrum of noise levels. In summary, the proposed LoMAE provides promising solutions to the major issues in LDCT including interpretability, ground truth data dependency, and model robustness/generalizability.
引用
收藏
页码:6815 / 6827
页数:13
相关论文
共 50 条
  • [21] Quadratic Autoencoder (Q-AE) for Low-Dose CT Denoising
    Fan, Fenglei
    Shan, Hongming
    Kalra, Mannudeep K.
    Singh, Ramandeep
    Qian, Guhan
    Getzin, Matthew
    Teng, Yueyang
    Hahn, Juergen
    Wang, Ge
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (06) : 2035 - 2050
  • [22] Denoising swin transformer and perceptual peak signal-to-noise ratio for low-dose CT image denoising
    Zhang, Boyan
    Zhang, Yingqi
    Wang, Binjie
    He, Xin
    Zhang, Fan
    Zhang, Xinhong
    MEASUREMENT, 2024, 227
  • [23] Real-World Low-Dose CT Image Denoising by Patch Similarity Purification
    Song, Zeya
    Xue, Liqi
    Xu, Jun
    Zhang, Baoping
    Jin, Chao
    Yang, Jian
    Zou, Changliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 196 - 208
  • [24] Generative Adversarial Network With Robust Discriminator Through Multi-Task Learning for Low-Dose CT Denoising
    Kyung, Sunggu
    Won, Jongjun
    Pak, Seongyong
    Kim, Sunwoo
    Lee, Sangyoon
    Park, Kanggil
    Hong, Gil-Sun
    Kim, Namkug
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 499 - 518
  • [25] Reducing the risk of hallucinations with interpretable deep learning models for low-dose CT denoising: comparative performance analysis
    Patwari, Mayank
    Gutjahr, Ralf
    Marcus, Roy
    Thali, Yannick
    Calvarons, Adria F.
    Raupach, Rainer
    Maier, Andreas
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (19)
  • [26] LOW-DOSE CT DENOISING VIA NEURAL ARCHITECTURE SEARCH
    Lu, Zexin
    Xia, Wenjun
    Huang, Yongqiang
    Hou, Mingzheng
    Chen, Hu
    Shan, Hongming
    Zhang, Yi
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [27] DDT-Net: Dose-Agnostic Dual-Task Transfer Network for Simultaneous Low-Dose CT Denoising and Simulation
    Meng, Mingqiang
    Wang, Yongbo
    Zhu, Manman
    Tao, Xi
    Mao, Zerui
    Liao, Jingyi
    Bian, Zhaoying
    Zeng, Dong
    Ma, Jianhua
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3613 - 3625
  • [28] Low-dose CT denoising using a Progressive Wasserstein generative adversarial network
    Wang, Guan
    Hu, Xueli
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135 (135)
  • [29] Artifact and Detail Attention Generative Adversarial Networks for Low-Dose CT Denoising
    Xiong Zhang
    Han, Zefang
    Hong Shangguan
    Han, Xinglong
    Cui, Xueying
    Wang, Anhong
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (12) : 3901 - 3918
  • [30] NO-REFERENCE DENOISING OF LOW-DOSE CT PROJECTIONS
    Zainulina, Elvira
    Chernyavskiy, Alexey
    Dylov, Dmitry, V
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 77 - 81