Multi-scale perceptual modulation network for low-dose computed tomography denoising

被引:0
作者
Huang, Jiexing [1 ]
Zhong, Anni [2 ]
Liu, Yujian [1 ]
机构
[1] Sun Yat Sen Univ, Affiliated Hosp 1, Dept Radiat Oncol, 58 Zhongshan Er Rd, Guangzhou 510080, Peoples R China
[2] Sun Yat Sen Univ, Affiliated Hosp 6, Dept Informat Technol, Guangzhou, Peoples R China
关键词
Low-dose computed tomography (LDCT); denoising; decomposable convolution; multi-scale perceptual; modulation; CT RECONSTRUCTION; NOISE-REDUCTION;
D O I
10.21037/qims-24-1145
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Low-dose computed tomography (LDCT) reduces radiation exposure, but the introduced noise and artifacts impair its diagnostic accuracy. Convolutional neural networks (CNNs) are widely used for LDCT denoising, but they suffer from a limited receptive field. The use of a larger kernel size can enlarge the receptive field and boost model performance; however, the computational cost of the model greatly increases. We aimed to develop a LDCT denoising CNN with a large receptive field and lower computational complexity. Methods: We developed a multi-scale perceptual modulation network (MSPMnet) incorporating a powerful multi-head decomposable convolution (MHDC). To address the high computational complexity of large kernel convolutions, we developed a novel MHDC module that can capture multi-scale features and efficiently expand the receptive field. The MHDC module couples maximum-pooling with three depth- wise convolutions of varying kernel sizes via a channel splitting mechanism, where, unlike conventional CNNs, the two large two-dimensional kernels are each decomposed into a set of cascaded orthogonal one-dimensional kernels to remain lightweight. Further, departing from prior methodologies that apply a uniform kernel size throughout the network, we introduced a receptive field-ramp mechanism that adeptly transitions from local to relatively long-range dependency modeling as the network depth increases, thereby achieving superior performance. Results: The proposed MSPMnet was evaluated on a Mayo Clinic data set with a conventional iterative algorithm, two CNN models, and two Transformer models used for comparison. Compared to the competing baseline methods, the MSPMnet exhibited better performance in both the visual and quantitative assessments. Visually, the MSPMnet preserved the structure, edges, and textures with excellent noise and artifact reduction, generating the denoised images closest to normal-dose computed tomography images. Quantitatively, the MSPMnet had the lowest root mean-square error (RMSE) (8.3094 +/- 1.9325) and the highest peak signal-to-noise ratio (PSNR) (33.8525 +/- 1.8213 dB), structural similarity index (SSIM) (0.9309 +/- 0.0272), and feature similarity index (FSIM) (0.9699 +/- 0.0113), demonstrating superior denoising performance. Conclusions: The proposed MSPMnet excelled at LDCT denoising, effectively removing noise and artifacts while preserving edges. Compared to the state-of-the-art CNNs and Transformers, the proposed MSPMnet exhibited superior denoising performance both quantitatively and qualitatively.
引用
收藏
页码:9290 / 9305
页数:16
相关论文
共 38 条
[21]   DSML-UNet: Depthwise separable convolution network with multiscale large kernel for medical image segmentation [J].
Wang, Biao ;
Qin, Juan ;
Lv, Lianrong ;
Cheng, Mengdan ;
Li, Lei ;
He, Junjie ;
Li, Dingyao ;
Xia, Dan ;
Wang, Meng ;
Ren, Haiping ;
Wang, Shike .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 97
[22]   CTformer: convolution-free Token2Token dilated vision transformer for low-dose CT denoising [J].
Wang, Dayang ;
Fan, Fenglei ;
Wu, Zhan ;
Liu, Rui ;
Wang, Fei ;
Yu, Hengyong .
PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (06)
[23]   Low-dose CT denoising using a Progressive Wasserstein generative adversarial network [J].
Wang, Guan ;
Hu, Xueli .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135 (135)
[24]   Penalized weighted least-squares approach to sinogram noise reduction and image reconstruction for low-dose X-ray computed tomography [J].
Wang, Jing ;
Li, Tianfang ;
Lu, Hongbing ;
Liang, Zhengrong .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2006, 25 (10) :1272-1283
[25]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612
[26]   Generative Adversarial Networks for Noise Reduction in Low-Dose CT [J].
Wolterink, Jelmer M. ;
Leiner, Tim ;
Viergever, Max A. ;
Isgum, Ivana .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) :2536-2545
[27]  
Wu DF, 2017, Arxiv, DOI arXiv:1705.04267
[28]   Low-Dose X-ray CT Reconstruction via Dictionary Learning [J].
Xu, Qiong ;
Yu, Hengyong ;
Mou, Xuanqin ;
Zhang, Lei ;
Hsieh, Jiang ;
Wang, Ge .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (09) :1682-1697
[29]  
Yang Jianwei, 2022, Focal Modulation Networks
[30]   Low-Dose CT Denoising via Sinogram Inner-Structure Transformer [J].
Yang, Liutao ;
Li, Zhongnian ;
Ge, Rongjun ;
Zhao, Junyong ;
Si, Haipeng ;
Zhang, Daoqiang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (04) :910-921