Knowledge distillation on individual vertebrae segmentation exploiting 3D U-Net

被引:8
|
作者
Serrador, Luis [1 ,2 ]
Villani, Francesca Pia [3 ]
Moccia, Sara [4 ,5 ]
Santos, Cristina P. [1 ,2 ]
机构
[1] Univ Minho, Ctr MicroElectroMechan Syst CMEMS, Guimaraes, Portugal
[2] Hosp Braga, Clin Acad Ctr Braga 2CA Braga, Braga, Portugal
[3] Univ Macerata, Dept Humanities, Macerata, Italy
[4] Scuola Super Sant Anna, BioRobot Inst, Pisa, Italy
[5] Scuola Super Sant Anna, Dept Excellence Robot & AI, Pisa, Italy
关键词
Vertebra segmentation; 3D U-net; Knowledge distillation; Computed tomography;
D O I
10.1016/j.compmedimag.2024.102350
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Recent advances in medical imaging have highlighted the critical development of algorithms for individual vertebral segmentation on computed tomography (CT) scans. Essential for diagnostic accuracy and treatment planning in orthopaedics, neurosurgery and oncology, these algorithms face challenges in clinical implementation, including integration into healthcare systems. Consequently, our focus lies in exploring the application of knowledge distillation (KD) methods to train shallower networks capable of efficiently segmenting vertebrae in CT scans. This approach aims to reduce segmentation time, enhance suitability for emergency cases, and optimize computational and memory resource efficiency. Building upon prior research in the field, a two-step segmentation approach was employed. Firstly, the spine's location was determined by predicting a heatmap, indicating the probability of each voxel belonging to the spine. Subsequently, an iterative segmentation of vertebrae was performed from the top to the bottom of the CT volume over the located spine, using a memory instance to record the already segmented vertebrae. KD methods were implemented by training a teacher network with performance similar to that found in the literature, and this knowledge was distilled to a shallower network (student). Two KD methods were applied: (1) using the soft outputs of both networks and (2) matching logits. Two publicly available datasets, comprising 319 CT scans from 300 patients and a total of 611 cervical, 2387 thoracic, and 1507 lumbar vertebrae, were used. To ensure dataset balance and robustness, effective data augmentation methods were applied, including cleaning the memory instance to replicate the first vertebra segmentation. The teacher network achieved an average Dice similarity coefficient (DSC) of 88.22% and a Hausdorff distance (HD) of 7.71 mm, showcasing performance similar to other approaches in the literature. Through knowledge distillation from the teacher network, the student network's performance improved, with an average DSC increasing from 75.78% to 84.70% and an HD decreasing from 15.17 mm to 8.08 mm. Compared to other methods, our teacher network exhibited up to 99.09% fewer parameters, 90.02% faster inference time, 88.46% shorter total segmentation time, and 89.36% less associated carbon (CO2) emission rate. Regarding our student network, it featured 75.00% fewer parameters than our teacher, resulting in a 36.15% reduction in inference time, a 33.33% decrease in total segmentation time, and a 42.96% reduction in CO2 emissions. This study marks the first exploration of applying KD to the problem of individual vertebrae segmentation in CT, demonstrating the feasibility of achieving comparable performance to existing methods using smaller neural networks.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] BTIS-Net: Efficient 3D U-Net for Brain Tumor Image Segmentation
    Liu, Li
    Xia, Kaijian
    IEEE ACCESS, 2024, 12 : 133392 - 133405
  • [32] Automatic brain tumor segmentation from Multiparametric MRI based on cascaded 3D U-Net and 3D U-Net++
    Li, Pengyu
    Wu, Wenhao
    Liu, Lanxiang
    Serry, Fardad Michael
    Wang, Jinjia
    Han, Hui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [33] On Improving 3D U-net Architecture
    Janovsky, Roman
    Sedlacek, David
    Zara, Jiri
    ICSOFT: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2019, : 649 - 656
  • [34] S3D-UNet: Separable 3D U-Net for Brain Tumor Segmentation
    Chen, Wei
    Liu, Boqiang
    Peng, Suting
    Sun, Jiawei
    Qiao, Xu
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 : 358 - 368
  • [35] 3D U-Net Based Automatic Segmentation of Organs at Risk From CT
    Liu, T.
    He, X.
    Zhao, R.
    Wang, A.
    Li, X.
    Shi, F.
    Tian, L.
    MEDICAL PHYSICS, 2019, 46 (06) : E628 - E628
  • [36] Fully automatic intervertebral disc segmentation using multimodal 3D U-net
    Wang, Chuanbo
    Guo, Ye
    Chen, Wei
    Yu, Zeyun
    arXiv, 2020,
  • [37] Feature Learning by Attention and Ensemble with 3D U-Net to Glioma Tumor Segmentation
    Cai, Xiaohong
    Lou, Shubin
    Shuai, Mingrui
    An, Zhulin
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 68 - 79
  • [38] Fully Automatic Intervertebral Disc Segmentation Using Multimodal 3D U-Net
    Wang, Chuanbo
    Guo, Ye
    Chen, Wei
    Yu, Zeyun
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 730 - 739
  • [39] Memory-Efficient Cascade 3D U-Net for Brain Tumor Segmentation
    Cheng, Xinchao
    Jiang, Zongkang
    Sun, Qiule
    Zhang, Jianxin
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 242 - 253
  • [40] NDNN based U-Net: An Innovative 3D Brain Tumor Segmentation Method
    Trivedi, Sandeep
    Patel, Nikhil
    Faruqui, Nuruzzaman
    2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 538 - 546