Automatic segmentation of esophageal gross tumor volume in 18F-FDG PET/CT images via GloD-LoATUNet

被引:6
作者
Yue, Yaoting [1 ,2 ]
Li, Nan [3 ]
Zhang, Gaobo [1 ]
Zhu, Zhibin [4 ]
Liu, Xin [5 ]
Song, Shaoli [3 ]
Ta, Dean [1 ,5 ]
机构
[1] Fudan Univ, Ctr Biomed Engn, Sch Informat Sci & Technol, Shanghai 200438, Peoples R China
[2] Fudan Univ, Human Phenome Inst, Shanghai 201203, Peoples R China
[3] Fudan Univ, Dept Nucl Med, Shanghai Canc Ctr, Shanghai 201321, Peoples R China
[4] Hexi Univ, Sch Phys & Electromech Engn, Zhangye 734000, Gansu, Peoples R China
[5] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Esophageal gross tumor volume; Segmentation; Transformer; PET; CT; SEMANTIC SEGMENTATION; ATTENTION; TRANSFORMER; CT;
D O I
10.1016/j.cmpb.2022.107266
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and objective: For esophageal squamous cell carcinoma, radiotherapy is one of the primary treatments. During the planning before radiotherapy, the intractable task is to precisely delineate the esophageal gross tumor volume (GTV) on medical images. In current clinical practice, the manual delineation suffers from high intra- and inter-rater variability, while also exhausting the oncologists on a treadmill. There is an urgent demand for effective com puter-aided automatic segmentationmethods. To this end, we designed a novel deep network, dubbed as GloD-LoATUNet. Methods: GloD-LoATUNet follows the effective U-shape structure. On the contractile path, the global deformable dense attention transformer (GloDAT), local attention transformer (LoAT), and convolution blocks are integrated to model long-range dependencies and localized information. On the center bridge and the expanding path, convolution blocks are adopted to upsample the extracted representations for pixel-wise semantic prediction. Between the peer-to-peer counterparts, enhanced skip connections are built to compensate for the lost spatial information and dependencies. By exploiting complementary strengths of the GloDAT, LoAT, and convolution, GloD-LoATUNet has remarkable representation learning capabilities, performing well in the prediction of the small and variable esophageal GTV.Results: The proposed approach was validated in the clinical positron emission tomography/computed tomography (PET/CT) cohort. For 4 different data partitions, we report the Dice similarity coefficient (DSC), Hausdorff distance (HD), and Mean surface distance (MSD) as: 0.83 +/- 0.13, 4.88 +/- 9.16 mm , and 1.40 +/- 4.11 mm ; 0.84 +/- 0.12, 6.89 +/- 12.04 mm , and 1.18 +/- 3.02 mm ; 0.84 +/- 0.13, 3.89 +/- 7.64 mm , and 1.28 +/- 3.68 mm ; 0.86 +/- 0.09, 3.71 +/- 4.79 mm , and 0.90 +/- 0.37 mm ; respectively. The predicted contours present a desirable consistency with the ground truth.Conclusions: The inspiring results confirm the accuracy and generalizability of the proposed model, demonstrating the potential for automatic segmentation of esophageal GTV in clinical practice.(c) 2022 Published by Elsevier B.V.
引用
收藏
页数:10
相关论文
共 38 条
  • [1] [Anonymous], 2020, ARXIV
  • [2] Burnet Neil G, 2004, Cancer Imaging, V4, P153, DOI 10.1102/1470-7330.2004.0054
  • [3] A novel convolutional neural network for kidney ultrasound images segmentation
    Chen, Gongping
    Yin, Jingjing
    Dai, Yu
    Zhang, Jianxun
    Yin, Xiaotao
    Cui, Liang
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 218
  • [4] Chen J, 2021, arXiv
  • [5] U-Net Plus: Deep Semantic Segmentation for Esophagus and Esophageal Cancer in Computed Tomography Images
    Chen, Shuchao
    Yang, Han
    Fu, Jiawen
    Mei, Weijian
    Ren, Shuai
    Liu, Yifei
    Zhu, Zhihua
    Liu, Lizhi
    Li, Haojiang
    Chen, Hongbo
    [J]. IEEE ACCESS, 2019, 7 : 82867 - 82877
  • [6] TSU-net: Two-stage multi-scale cascade and multi-field fusion U-net for right ventricular segmentation
    Du, Xiuquan
    Xu, Xiaofei
    Liu, Heng
    Li, Shuo
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2021, 93 (93)
  • [7] Esophagus segmentation in CT via 3D fully convolutional neural network and random walk
    Fechter, Tobias
    Adebahr, Sonja
    Baltas, Dimos
    Ben Ayed, Ismail
    Desrosiers, Christian
    Dolz, Jose
    [J]. MEDICAL PHYSICS, 2017, 44 (12) : 6341 - 6352
  • [8] A deep learning based multiscale approach to segment the areas of interest in whole slide images
    Feng, Yanbo
    Hafiane, Adel
    Laurent, Helene
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2021, 90
  • [9] Hao Z, 2017, P CHIN INT SYST C MU, P413
  • [10] Hatamizadeh A., 2021, P IEEE WINTER C APPL, P1