Benchmarking Deep Learning Models for Tooth Structure Segmentation

被引:14
|
作者
Schneider, L. [1 ,2 ]
Arsiwala-Scheppach, L. [1 ,2 ]
Krois, J. [1 ,2 ]
Meyer-Lueckel, H. [3 ]
Bressem, K. K. [4 ,5 ]
Niehues, S. M. [4 ]
Schwendicke, F. [1 ,2 ]
机构
[1] Charite, Dept Oral Diagnost Digital Hlth & Hlth Serv Res, Assmannshauser Str 4 6, D-14197 Berlin, Germany
[2] ITU WHO Focus Grp Hlth, Top Grp Dent Diagnost & Digital Dent, Geneva, Switzerland
[3] Univ Bern, Zahnmed Kliniken Univ Bern, Dept Restorat Prevent & Pediat Dent, Bern, Switzerland
[4] Charite Univ Med Berlin, Radiol Klin, Berlin, Germany
[5] Charite Univ Med Berlin, Berlin Inst Hlth, Berlin, Germany
关键词
computer vision; artificial intelligence; segmentation; tooth structures; transfer learning; neural networks;
D O I
10.1177/00220345221100169
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
A wide range of deep learning (DL) architectures with varying depths are available, with developers usually choosing one or a few of them for their specific task in a nonsystematic way. Benchmarking (i.e., the systematic comparison of state-of-the art architectures on a specific task) may provide guidance in the model development process and may allow developers to make better decisions. However, comprehensive benchmarking has not been performed in dentistry yet. We aimed to benchmark a range of architecture designs for 1 specific, exemplary case: tooth structure segmentation on dental bitewing radiographs. We built 72 models for tooth structure (enamel, dentin, pulp, fillings, crowns) segmentation by combining 6 different DL network architectures (U-Net, U-Net++, Feature Pyramid Networks, LinkNet, Pyramid Scene Parsing Network, Mask Attention Network) with 12 encoders from 3 different encoder families (ResNet, VGG, DenseNet) of varying depth (e.g., VGG13, VGG16, VGG19). On each model design, 3 initialization strategies (ImageNet, CheXpert, random initialization) were applied, resulting overall into 216 trained models, which were trained up to 200 epochs with the Adam optimizer (learning rate = 0.0001) and a batch size of 32. Our data set consisted of 1,625 human-annotated dental bitewing radiographs. We used a 5-fold cross-validation scheme and quantified model performances primarily by the F1-score. Initialization with ImageNet or CheXpert weights significantly outperformed random initialization (P < 0.05). Deeper and more complex models did not necessarily perform better than less complex alternatives. VGG-based models were more robust across model configurations, while more complex models (e.g., from the ResNet family) achieved peak performances. In conclusion, initializing models with pretrained weights may be recommended when training models for dental radiographic analysis. Less complex model architectures may be competitive alternatives if computational resources and training time are restricting factors. Models developed and found superior on nondental data sets may not show this behavior for dental domain-specific tasks.
引用
收藏
页码:1343 / 1349
页数:7
相关论文
共 50 条
  • [1] Benchmarking Deep Learning Models for Instance Segmentation
    Jung, Sunguk
    Heo, Hyeonbeom
    Park, Sangheon
    Jung, Sung-Uk
    Lee, Kyungjae
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [2] Benchmarking Deep Learning Models on Point Cloud Segmentation
    Zoumpekas, Thanasis
    Molina, Guillem
    Salamo, Maria
    Puig, Anna
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2021, 339 : 335 - 344
  • [3] Accuracy and efficiency of automatic tooth segmentation in digital dental models using deep learning
    Joon Im
    Ju-Yeong Kim
    Hyung-Seog Yu
    Kee-Joon Lee
    Sung-Hwan Choi
    Ji-Hoi Kim
    Hee-Kap Ahn
    Jung-Yul Cha
    Scientific Reports, 12
  • [4] Accuracy and efficiency of automatic tooth segmentation in digital dental models using deep learning
    Im, Joon
    Kim, Ju-Yeong
    Yu, Hyung-Seog
    Lee, Kee-Joon
    Choi, Sung-Hwan
    Kim, Ji-Hoi
    Ahn, Hee-Kap
    Cha, Jung-Yul
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Benchmarking deep learning models on large healthcare datasets
    Purushotham, Sanjay
    Meng, Chuizheng
    Che, Zhengping
    Liu, Yan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 83 : 112 - 134
  • [6] Benchmarking Deep Learning Models for Classification of Book Covers
    Lucieri A.
    Sabir H.
    Siddiqui S.A.
    Rizvi S.T.R.
    Iwana B.K.
    Uchida S.
    Dengel A.
    Ahmed S.
    SN Computer Science, 2020, 1 (3)
  • [7] Automatic Segmentation of Internal Tooth Structure from CBCT Images Using Hierarchical Deep Learning
    Kim, SaeHyun
    Song, In-Seok
    Baek, Seung Jun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 703 - 713
  • [8] Review of Deep Learning Models for Spine Segmentation
    Zhou, Neng
    Wen, Hairu
    Wang, Yi
    Liu, Yang
    Zhou, Longfei
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 498 - 507
  • [9] Benchmarking Inference Performance of Deep Learning Models on Analog Devices
    Fagbohungbe, Omobayode
    Qian, Lijun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Benchmarking Object Detection Deep Learning Models in Embedded Devices
    Cantero, David
    Esnaola-Gonzalez, Iker
    Miguel-Alonso, Jose
    Jauregi, Ekaitz
    SENSORS, 2022, 22 (11)