DLFA-NET: A HYBRID NETWORK FOR MR-TO-CT SYNTHESIS VIA GLOBAL AND LOCAL FEATURE AGGREGATION

被引：2

作者：

Chen, Zeli ^{[1
,2
]}

Li, Chuanpu ^{[1
,2
]}

Zheng, Kaiyi ^{[1
,2
]}

Zhang, Yiwen ^{[1
,2
]}

Wu, Yuankui ^{[3
]}

Feng, Qianjin ^{[1
,2
]}

Zhong, Liming

Yang, Wei

机构：

[1] Southern Med Univ, Sch Biomed Engn, Guangzhou, Peoples R China

[2] Southern Med Univ, Guangdong Prov Key Lab Med Image Proc, Guangzhou, Peoples R China

[3] Southern Med Univ, Nanfang Hosp, Dept Med Imaging Ctr, Guangzhou 510515, Peoples R China

来源：

2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI | 2023年

基金：

中国国家自然科学基金;

关键词：

MR-to-CT synthesis; MR-only treatment planning; Transformer; Convolutional neural network;

D O I：

10.1109/ISBI53787.2023.10230486

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Synthesis of Computed Tomography (CT) images from Magnetic Resonance (MR) images is clinical significance for MR-only treatment planning to eliminate the co-registration errors between MR and CT images. Existing convolutional neural network-based methods, suffering from the inherent local inductive biases, struggle to distinguish bone and air which show low signals in conventional MR images. ViT-based methods can learn long-range contextual information by a global self-attention mechanism but are limited by the quadratic complexity and generating local detailed structures. Combining the merits of these two architectures, we propose a hybrid network for MR-to-CT synthesis via global and local feature aggregation from Transformer and CNN, named GLFA-NET. Specifically, we add a global patch embedding branch to supplement the patch-based global representative features directly from the image and design a residual dilated swin transformer block aggregating the local detailed features and global features to improve the synthesis performance of bone and air and reduce computational overhead. Furthermore, we adopt a wavelet PatchGAN discriminator to enhance the high-frequency detailed information of the synthetic CT. Our GLFA-NET was implemented on a dataset with 154 pairs of 3D MR-CT head and neck images. Experiments show that our GLFA-NET achieves impressive performance with MAE of 71.12 +/- 10.87, SSIM of 0.771 +/- 0.028, and PSNR of 28.91 +/- 1.33. The visual synthetic CT results also show that the proposed GLFA-NET method achieves better discrimination of bone and air and higher structural similarity than other state-of-the-art methods.

引用

页数：5

共 17 条

[1] Improving generalization in MR-to-CT synthesis in radiotherapy by using an augmented cycle generative adversarial network with unpaired data [J].

Boni, Evin N. D. Brou ;

Klein, John ;

Gulyban, Akos ;

Reynaert, Nick ;

Pasquier, David .

MEDICAL PHYSICS, 2021, 48 (06) :3003-3010

[2] MR to CT synthesis with multicenter data in the pelvic area using a conditional generative adversarial network [J].

Boni, Kevin N. D. Brou ;

Klein, John ;

Vanquin, Ludovic ;

Wagner, Antoine ;

Lacornerie, Thomas ;

Pasquier, David ;

Reynaert, Nick .

PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (07)

[3] Gated Context Aggregation Network for Image Dehazing and Deraining [J].

Chen, Dongdong ;

He, Mingming ;

Fan, Qingnan ;

Liao, Jing ;

Zhang, Liheng ;

Hou, Dongdong ;

Yuan, Lu ;

Hua, Gang .

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1375-1383

[4]

Chen J., 2021, arXiv

[5] ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis [J].

Dalmaz, Onat ;

Yurt, Mahmut ;

Cukur, Tolga .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) :2598-2614

[6]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[7] Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images [J].

Hatamizadeh, Ali ;

Nath, Vishwesh ;

Tang, Yucheng ;

Yang, Dong ;

Roth, Holger R. ;

Xu, Daguang .

BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT I, 2022, 12962 :272-284

[8] Image-to-Image Translation with Conditional Adversarial Networks [J].

Isola, Phillip ;

Zhu, Jun-Yan ;

Zhou, Tinghui ;

Efros, Alexei A. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976

[9] Attribute-aware Face Aging with Wavelet-based Generative Adversarial Networks [J].

Liu, Yunfan ;

Li, Qi ;

Sun, Zhenan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11869-11878

[10]

Nie Dong, 2017, Med Image Comput Comput Assist Interv, V10435, P417, DOI 10.1007/978-3-319-66179-7_48

← 1 2 →