DLFA-NET: A HYBRID NETWORK FOR MR-TO-CT SYNTHESIS VIA GLOBAL AND LOCAL FEATURE AGGREGATION

被引:2
作者
Chen, Zeli [1 ,2 ]
Li, Chuanpu [1 ,2 ]
Zheng, Kaiyi [1 ,2 ]
Zhang, Yiwen [1 ,2 ]
Wu, Yuankui [3 ]
Feng, Qianjin [1 ,2 ]
Zhong, Liming
Yang, Wei
机构
[1] Southern Med Univ, Sch Biomed Engn, Guangzhou, Peoples R China
[2] Southern Med Univ, Guangdong Prov Key Lab Med Image Proc, Guangzhou, Peoples R China
[3] Southern Med Univ, Nanfang Hosp, Dept Med Imaging Ctr, Guangzhou 510515, Peoples R China
来源
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI | 2023年
基金
中国国家自然科学基金;
关键词
MR-to-CT synthesis; MR-only treatment planning; Transformer; Convolutional neural network;
D O I
10.1109/ISBI53787.2023.10230486
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Synthesis of Computed Tomography (CT) images from Magnetic Resonance (MR) images is clinical significance for MR-only treatment planning to eliminate the co-registration errors between MR and CT images. Existing convolutional neural network-based methods, suffering from the inherent local inductive biases, struggle to distinguish bone and air which show low signals in conventional MR images. ViT-based methods can learn long-range contextual information by a global self-attention mechanism but are limited by the quadratic complexity and generating local detailed structures. Combining the merits of these two architectures, we propose a hybrid network for MR-to-CT synthesis via global and local feature aggregation from Transformer and CNN, named GLFA-NET. Specifically, we add a global patch embedding branch to supplement the patch-based global representative features directly from the image and design a residual dilated swin transformer block aggregating the local detailed features and global features to improve the synthesis performance of bone and air and reduce computational overhead. Furthermore, we adopt a wavelet PatchGAN discriminator to enhance the high-frequency detailed information of the synthetic CT. Our GLFA-NET was implemented on a dataset with 154 pairs of 3D MR-CT head and neck images. Experiments show that our GLFA-NET achieves impressive performance with MAE of 71.12 +/- 10.87, SSIM of 0.771 +/- 0.028, and PSNR of 28.91 +/- 1.33. The visual synthetic CT results also show that the proposed GLFA-NET method achieves better discrimination of bone and air and higher structural similarity than other state-of-the-art methods.
引用
收藏
页数:5
相关论文
共 17 条
[1]   Improving generalization in MR-to-CT synthesis in radiotherapy by using an augmented cycle generative adversarial network with unpaired data [J].
Boni, Evin N. D. Brou ;
Klein, John ;
Gulyban, Akos ;
Reynaert, Nick ;
Pasquier, David .
MEDICAL PHYSICS, 2021, 48 (06) :3003-3010
[2]   MR to CT synthesis with multicenter data in the pelvic area using a conditional generative adversarial network [J].
Boni, Kevin N. D. Brou ;
Klein, John ;
Vanquin, Ludovic ;
Wagner, Antoine ;
Lacornerie, Thomas ;
Pasquier, David ;
Reynaert, Nick .
PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (07)
[3]   Gated Context Aggregation Network for Image Dehazing and Deraining [J].
Chen, Dongdong ;
He, Mingming ;
Fan, Qingnan ;
Liao, Jing ;
Zhang, Liheng ;
Hou, Dongdong ;
Yuan, Lu ;
Hua, Gang .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1375-1383
[4]  
Chen J., 2021, arXiv
[5]   ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis [J].
Dalmaz, Onat ;
Yurt, Mahmut ;
Cukur, Tolga .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) :2598-2614
[6]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7]   Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images [J].
Hatamizadeh, Ali ;
Nath, Vishwesh ;
Tang, Yucheng ;
Yang, Dong ;
Roth, Holger R. ;
Xu, Daguang .
BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT I, 2022, 12962 :272-284
[8]   Image-to-Image Translation with Conditional Adversarial Networks [J].
Isola, Phillip ;
Zhu, Jun-Yan ;
Zhou, Tinghui ;
Efros, Alexei A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5967-5976
[9]   Attribute-aware Face Aging with Wavelet-based Generative Adversarial Networks [J].
Liu, Yunfan ;
Li, Qi ;
Sun, Zhenan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11869-11878
[10]  
Nie Dong, 2017, Med Image Comput Comput Assist Interv, V10435, P417, DOI 10.1007/978-3-319-66179-7_48