Deep learning in computed tomography super resolution using multi-modality data training

被引:3
作者
Fok, Wai Yan Ryana [1 ,2 ,5 ]
Fieselmann, Andreas [1 ]
Herbst, Magdalena [1 ]
Ritschl, Ludwig [1 ]
Kappler, Steffen [1 ]
Saalfeld, Sylvia [3 ,4 ]
机构
[1] Siemens Healthcare GmbH, Xray Prod, Forchheim, Germany
[2] Otto von Guericke Univ, Fac Comp Sci, Magdeburg, Germany
[3] Ilmenau Univ Technol, Computat Med Grp, Ilmenau, Germany
[4] Otto von Guericke Univ, Res Campus STIMULATE, Magdeburg, Germany
[5] Siemensstr 3, D-91301 Forchheim, Germany
关键词
cone-beam computed tomography; deep learning; multimodality; super resolution; MODULATION TRANSFER-FUNCTION; IMAGE; SUPERRESOLUTION; CT; KERNEL;
D O I
10.1002/mp.16825
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundOne of the limitations in leveraging the potential of artificial intelligence in X-ray imaging is the limited availability of annotated training data. As X-ray and CT shares similar imaging physics, one could achieve cross-domain data sharing, so to generate labeled synthetic X-ray images from annotated CT volumes as digitally reconstructed radiographs (DRRs). To account for the lower resolution of CT and the CT-generated DRRs as compared to the real X-ray images, we propose the use of super-resolution (SR) techniques to enhance the CT resolution before DRR generation.PurposeAs spatial resolution can be defined by the modulation transfer function kernel in CT physics, we propose to train a SR network using paired low-resolution (LR) and high-resolution (HR) images by varying the kernel's shape and cutoff frequency. This is different to previous deep learning-based SR techniques on RGB and medical images which focused on refining the sampling grid. Instead of generating LR images by bicubic interpolation, we aim to create realistic multi-detector CT (MDCT) like LR images from HR cone-beam CT (CBCT) scans.MethodsWe propose and evaluate the use of a SR U-Net for the mapping between LR and HR CBCT image slices. We reconstructed paired LR and HR training volumes from the same CT scans with small in-plane sampling grid size of '(Res). We used the residual U-Net architecture to train two models. SRUN (k)(Res ): trained with kernel-based LR images, and SRUN'(Res): trained with bicubic downsampled data as baseline. Both models are trained on one CBCT dataset (n = 13 391). The performance of both models was then evaluated on unseen kernel-based and interpolation-based LR CBCT images (n = 10 950), and also on MDCT images (n = 1392).ResultsFive-fold cross validation and ablation study were performed to find the optimal hyperparameters. Both SRUNResk and SRUN'(Res )models show significant improvements (p-value < 0.05) in mean absolute error (MAE), peak signal-to-noise ratio (PSNR) and structural similarity index measures (SSIMs) on unseen CBCT images. Also, the improvement percentages in MAE, PSNR, and SSIM by SRUN (k)(Res) is larger than SRUN '(Res). For SRUN (k)(Res), MAE is reduced by 14%, and PSNR and SSIMs increased by 6 and 8%, respectively. To conclude, SRUN (k)(Res) outperforms SRUN'(Res), which the former generates sharper images when tested with kernel-based LR CBCT images as well as cross-modality LR MDCT data.ConclusionsOur proposed method showed better performance than the baseline interpolation approach on unseen LR CBCT. We showed that the frequency behavior of the used data is important for learning the SR features. Additionally, we showed cross-modality resolution improvements to LR MDCT images. Our approach is, therefore, a first and essential step in enabling realistic high spatial resolution CT-generated DRRs for deep learning training.
引用
收藏
页码:2846 / 2860
页数:15
相关论文
共 53 条
[1]   The Medical Segmentation Decathlon [J].
Antonelli, Michela ;
Reinke, Annika ;
Bakas, Spyridon ;
Farahani, Keyvan ;
Kopp-Schneider, Annette ;
Landman, Bennett A. ;
Litjens, Geert ;
Menze, Bjoern ;
Ronneberger, Olaf ;
Summers, Ronald M. ;
van Ginneken, Bram ;
Bilello, Michel ;
Bilic, Patrick ;
Christ, Patrick F. ;
Do, Richard K. G. ;
Gollub, Marc J. ;
Heckers, Stephan H. ;
Huisman, Henkjan ;
Jarnagin, William R. ;
McHugo, Maureen K. ;
Napel, Sandy ;
Pernicka, Jennifer S. Golia ;
Rhode, Kawal ;
Tobon-Gomez, Catalina ;
Vorontsov, Eugene ;
Meakin, James A. ;
Ourselin, Sebastien ;
Wiesenfarth, Manuel ;
Arbelaez, Pablo ;
Bae, Byeonguk ;
Chen, Sihong ;
Daza, Laura ;
Feng, Jianjiang ;
He, Baochun ;
Isensee, Fabian ;
Ji, Yuanfeng ;
Jia, Fucang ;
Kim, Ildoo ;
Maier-Hein, Klaus ;
Merhof, Dorit ;
Pai, Akshay ;
Park, Beomhee ;
Perslev, Mathias ;
Rezaiifar, Ramin ;
Rippel, Oliver ;
Sarasua, Ignacio ;
Shen, Wei ;
Son, Jaemin ;
Wachinger, Christian ;
Wang, Liansheng .
NATURE COMMUNICATIONS, 2022, 13 (01)
[2]   Photon-counting Detector CT with Deep Learning Noise Reduction to Detect Multiple Myeloma [J].
Baffour, Francis I. ;
Huber, Nathan R. ;
Ferrero, Andrea ;
Rajendran, Kishore ;
Glazebrook, Katrina N. ;
Larson, Nicholas B. ;
Kumar, Shaji ;
Cook, Joselle M. ;
Leng, Shuai ;
Shanblatt, Elisabeth R. ;
McCollough, Cynthia H. ;
Fletcher, Joel G. .
RADIOLOGY, 2023, 306 (01) :229-236
[3]   Automated Detection and Quantification of COVID-19 Airspace Disease on Chest Radiographs A Novel Approach Achieving Expert Radiologist-Level Performance Using a Deep Convolutional Neural Network Trained on Digital Reconstructed Radiographs From Computed Tomography-Derived Ground Truth [J].
Barbosa Jr, Eduardo J. Mortani ;
Gefter, Warren B. ;
Ghesu, Florin C. ;
Liu, Siqi ;
Mailhe, Boris ;
Mansoor, Awais ;
Grbic, Sasa ;
Vogt, Sebastian .
INVESTIGATIVE RADIOLOGY, 2021, 56 (08) :471-479
[4]  
Bell-Kligler S, 2019, ADV NEUR IN, V32
[5]  
Bhandary M., 2022, ARXIV
[6]   Determination of the presampled MTF in computed tomography [J].
Boone, JM .
MEDICAL PHYSICS, 2001, 28 (03) :356-360
[7]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307
[8]   Effect of 10% formalin on radiographic optical density of bone specimens [J].
Fonseca, A. A. ;
Cherubini, K. ;
Veeck, E. B. ;
Ladeira, R. S. ;
Carapeto, L. P. .
DENTOMAXILLOFACIAL RADIOLOGY, 2008, 37 (03) :137-141
[9]   A simple approach to measure computed tomography (CT) modulation transfer function (MTF) and noise-power spectrum (NPS) using the American College of Radiology (ACR) accreditation phantom [J].
Friedman, Saul N. ;
Fung, George S. K. ;
Siewerdsen, Jeffrey H. ;
Tsui, Benjamin M. W. .
MEDICAL PHYSICS, 2013, 40 (05)
[10]   Frequency Separation for Real-World Super-Resolution [J].
Fritsche, Manuel ;
Gu, Shuhang ;
Timofte, Radu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3599-3608