Multi-scale perceptual modulation network for low-dose computed tomography denoising

被引:0
作者
Huang, Jiexing [1 ]
Zhong, Anni [2 ]
Liu, Yujian [1 ]
机构
[1] Sun Yat Sen Univ, Affiliated Hosp 1, Dept Radiat Oncol, 58 Zhongshan Er Rd, Guangzhou 510080, Peoples R China
[2] Sun Yat Sen Univ, Affiliated Hosp 6, Dept Informat Technol, Guangzhou, Peoples R China
关键词
Low-dose computed tomography (LDCT); denoising; decomposable convolution; multi-scale perceptual; modulation; CT RECONSTRUCTION; NOISE-REDUCTION;
D O I
10.21037/qims-24-1145
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Low-dose computed tomography (LDCT) reduces radiation exposure, but the introduced noise and artifacts impair its diagnostic accuracy. Convolutional neural networks (CNNs) are widely used for LDCT denoising, but they suffer from a limited receptive field. The use of a larger kernel size can enlarge the receptive field and boost model performance; however, the computational cost of the model greatly increases. We aimed to develop a LDCT denoising CNN with a large receptive field and lower computational complexity. Methods: We developed a multi-scale perceptual modulation network (MSPMnet) incorporating a powerful multi-head decomposable convolution (MHDC). To address the high computational complexity of large kernel convolutions, we developed a novel MHDC module that can capture multi-scale features and efficiently expand the receptive field. The MHDC module couples maximum-pooling with three depth- wise convolutions of varying kernel sizes via a channel splitting mechanism, where, unlike conventional CNNs, the two large two-dimensional kernels are each decomposed into a set of cascaded orthogonal one-dimensional kernels to remain lightweight. Further, departing from prior methodologies that apply a uniform kernel size throughout the network, we introduced a receptive field-ramp mechanism that adeptly transitions from local to relatively long-range dependency modeling as the network depth increases, thereby achieving superior performance. Results: The proposed MSPMnet was evaluated on a Mayo Clinic data set with a conventional iterative algorithm, two CNN models, and two Transformer models used for comparison. Compared to the competing baseline methods, the MSPMnet exhibited better performance in both the visual and quantitative assessments. Visually, the MSPMnet preserved the structure, edges, and textures with excellent noise and artifact reduction, generating the denoised images closest to normal-dose computed tomography images. Quantitatively, the MSPMnet had the lowest root mean-square error (RMSE) (8.3094 +/- 1.9325) and the highest peak signal-to-noise ratio (PSNR) (33.8525 +/- 1.8213 dB), structural similarity index (SSIM) (0.9309 +/- 0.0272), and feature similarity index (FSIM) (0.9699 +/- 0.0113), demonstrating superior denoising performance. Conclusions: The proposed MSPMnet excelled at LDCT denoising, effectively removing noise and artifacts while preserving edges. Compared to the state-of-the-art CNNs and Transformers, the proposed MSPMnet exhibited superior denoising performance both quantitatively and qualitatively.
引用
收藏
页码:9290 / 9305
页数:16
相关论文
共 38 条
[1]   Ray Contribution Masks for Structure Adaptive Sinogram Filtering [J].
Balda, Michael ;
Hornegger, Joachim ;
Heismann, Bjoern .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (06) :1228-1239
[2]  
Bing Zhenshan, 2023, 11 INT C LEARN REPR
[3]   Cine Cone Beam CT Reconstruction Using Low-Rank Matrix Factorization: Algorithm and a Proof-of-Principle Study [J].
Cai, Jian-Feng ;
Jia, Xun ;
Gao, Hao ;
Jiang, Steve B. ;
Shen, Zuowei ;
Zhao, Hongkai .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2014, 33 (08) :1581-1591
[4]   Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network [J].
Chen, Hu ;
Zhang, Yi ;
Kalra, Mannudeep K. ;
Lin, Feng ;
Chen, Yang ;
Liao, Peixi ;
Zhou, Jiliu ;
Wang, Ge .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) :2524-2535
[5]  
Cui Y., 2023, Proceedings of the international conference on machine learning, P6545
[6]  
Cui YN, 2024, AAAI CONF ARTIF INTE, P1426
[7]   Focal Network for Image Restoration [J].
Cui, Yuning ;
Ren, Wenqi ;
Cao, Xiaochun ;
Knoll, Alois .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12955-12965
[8]  
Cui YN, 2023, PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, P645
[9]   Exploring the potential of channel interactions for image restoration [J].
Cui, Yuning ;
Knoll, Alois .
KNOWLEDGE-BASED SYSTEMS, 2023, 282
[10]   Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition [J].
Hou, Qibin ;
Lu, Cheng-Ze ;
Cheng, Ming-Ming ;
Feng, Jiashi .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) :8274-8283