HCformer: Hybrid CNN-Transformer for LDCT Image Denoising

被引：28

作者：

Yuan, Jinli ^{[1
]}

Zhou, Feng ^{[1
]}

Guo, Zhitao ^{[1
]}

Li, Xiaozeng ^{[1
]}

Yu, Hengyong ^{[2
]}

机构：

[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China

[2] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01854 USA

来源：

JOURNAL OF DIGITAL IMAGING | 2023年 / 36卷 / 05期

关键词：

Low-dose CT; Deep learning; CT image denoising; LOW-DOSE CT; NETWORK;

D O I：

10.1007/s10278-023-00842-9

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Low-dose computed tomography (LDCT) is an effective way to reduce radiation exposure for patients. However, it will increase the noise of reconstructed CT images and affect the precision of clinical diagnosis. The majority of the current deep learning-based denoising methods are built on convolutional neural networks (CNNs), which concentrate on local information and have little capacity for multiple structures modeling. Transformer structures are capable of computing each pixel's response on a global scale, but their extensive computation requirements prevent them from being widely used in medical image processing. To reduce the impact of LDCT scans on patients, this paper aims to develop an image post-processing method by combining CNN and Transformer structures. This method can obtain a high-quality images from LDCT. A hybrid CNN-Transformer (HCformer) codec network model is proposed for LDCT image denoising. A neighborhood feature enhancement (NEF) module is designed to introduce the local information into the Transformer's operation, and the representation of adjacent pixel information in the LDCT image denoising task is increased. The shifting window method is utilized to lower the computational complexity of the network model and overcome the problems that come with computing the MSA (Multi-head self-attention) process in a fixed window. Meanwhile, W/SW-MSA (Windows/Shifted window Multi-head self-attention) is alternately used in two layers of the Transformer to gain the information interaction between various Transformer layers. This approach can successfully decrease the Transformer's overall computational cost. The AAPM 2016 LDCT grand challenge dataset is employed for ablation and comparison experiments to demonstrate the viability of the proposed LDCT denoising method. Per the experimental findings, HCformer can increase the image quality metrics SSIM, HuRMSE and FSIM from 0.8017, 34.1898, and 0.6885 to 0.8507, 17.7213, and 0.7247, respectively. Additionally, the proposed HCformer algorithm will preserves image details while it reduces noise. In this paper, an HCformer structure is proposed based on deep learning and evaluated by using the AAPM LDCT dataset. Both the qualitative and quantitative comparison results confirm that the proposed HCformer outperforms other methods. The contribution of each component of the HCformer is also confirmed by the ablation experiments. HCformer can combine the advantages of CNN and Transformer, and it has great potential for LDCT image denoising and other tasks.

引用

页码：2290 / 2305

页数：16

共 40 条

[1] Current concepts - Computed tomography - An increasing source of radiation exposure [J].

Brenner, David J. ;

Hall, Eric J. .

NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (22) :2277-2284

[2] A non-local algorithm for image denoising [J].

Buades, A ;

Coll, B ;

Morel, JM .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65

[3] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[4] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[5] Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network [J].

Chen, Hu ;

Zhang, Yi ;

Kalra, Mannudeep K. ;

Lin, Feng ;

Chen, Yang ;

Liao, Peixi ;

Zhou, Jiliu ;

Wang, Ge .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) :2524-2535

[6] Low-dose CT via convolutional neural network [J].

Chen, Hu ;

Zhang, Yi ;

Zhang, Weihua ;

Liao, Peixi ;

Li, Ke ;

Zhou, Jiliu ;

Wang, Ge .

BIOMEDICAL OPTICS EXPRESS, 2017, 8 (02) :679-694

[7]

Chen Jieneng, 2021, arXiv

[8]

Cordonnier J.-B., 2019, ARXIV

[9]

Dosovitskiy A., 2021, arXiv

[10]

Elsayed G., 2020, INT C MACHINE LEARNI, P2868, DOI DOI 10.48550/ARXIV.2002.02959

← 1 2 3 4 →