A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation

被引:65
作者
Laves, Max-Heinrich [1 ]
Bicker, Jens [1 ]
Kahrs, Lueder A. [1 ]
Ortmaier, Tobias [1 ]
机构
[1] Leibniz Univ Hannover, Appelstr 11A, D-30167 Hannover, Germany
关键词
Computer vision; Larynx; Vocal folds; Soft tissue; Open-access dataset; Machine learning; Patient-to-patient fine-tuning; SOFT-TISSUE MOTION; CLASSIFICATION; TRACKING;
D O I
10.1007/s11548-018-01910-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
PurposeAutomated segmentation of anatomical structures in medical image analysis is a prerequisite for autonomous diagnosis as well as various computer- and robot-aided interventions. Recent methods based on deep convolutional neural networks (CNN) have outperformed former heuristic methods. However, those methods were primarily evaluated on rigid, real-world environments. In this study, existing segmentation methods were evaluated for their use on a new dataset of transoral endoscopic exploration.MethodsFour machine learning-based methods SegNet, UNet, ENet and ErfNet were trained with supervision on a novel 7-class dataset of the human larynx. The dataset contains 536 manually segmented images from two patients during laser incisions. The Intersection-over-Union (IoU) evaluation metric was used to measure the accuracy of each method. Data augmentation and network ensembling were employed to increase segmentation accuracy. Stochastic inference was used to show uncertainties of the individual models. Patient-to-patient transfer was investigated using patient-specific fine-tuning.ResultsIn this study, a weighted average ensemble network of UNet and ErfNet was best suited for the segmentation of laryngeal soft tissue with a mean IoU of 84.7%. The highest efficiency was achieved by ENet with a mean inference time of 9.22ms per image. It is shown that 10 additional images from a new patient are sufficient for patient-specific fine-tuning.ConclusionCNN-based methods for semantic segmentation are applicable to endoscopic images of laryngeal soft tissue. The segmentation can be used for active constraints or to monitor morphological changes and autonomously detect pathologies. Further improvements could be achieved by using a larger dataset or training the models in a self-supervised manner on additional unlabeled data.
引用
收藏
页码:483 / 492
页数:10
相关论文
共 50 条
  • [11] Intelligent weight prediction of cows based on semantic segmentation and back propagation neural network
    Xu, Beibei
    Mao, Yifan
    Wang, Wensheng
    Chen, Guipeng
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [12] Automatic glottis segmentation for laryngeal endoscopic images based on U-Net
    Ding, Huijun
    Cen, Qian
    Si, Xiaoyu
    Pan, Zhanpeng
    Chen, Xiangdong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [13] Combined Transfer Learning and Test-Time Augmentation Improves Convolutional Neural Network-Based Semantic Segmentation of Prostate Cancer from Multi-Parametric MR Images
    Hoar, David
    Lee, Peter Q.
    Guida, Alessandro
    Patterson, Steven
    Bowen, Chris, V
    Merrimen, Jennifer
    Wang, Cheng
    Rendon, Ricardo
    Beyea, Steven D.
    Clarke, Sharon E.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 210 (210)
  • [14] Comparative Study of Neural Network- Based Approaches for QRS Segmentation
    Kolokolnikov, George
    Borde, Anna
    Skuratov, Victor
    Gaponov, Roman
    Rumyantseva, Anastasiya
    INTERNATIONAL JOURNAL OF EMBEDDED AND REAL-TIME COMMUNICATION SYSTEMS (IJERTCS), 2020, 11 (04): : 80 - 103
  • [15] A convolutional neural network model for semantic segmentation of mitotic events in microscopy images
    Orturk, Saban
    Akdemir, Bayram
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) : 3719 - 3728
  • [16] Classification of endoscopic images based on texture and neural network
    Wang, P
    Krishnan, SM
    Kugean, C
    Tjoa, MP
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 3691 - 3695
  • [17] Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images
    Wang, Hongzhen
    Wang, Ying
    Zhang, Qian
    Xiang, Shiming
    Pan, Chunhong
    REMOTE SENSING, 2017, 9 (05)
  • [18] Semantic Segmentation of Remote Sensing Image Based on Neural Network
    Wang Ende
    Qi Kai
    Li Xuepeng
    Peng Liangyu
    ACTA OPTICA SINICA, 2019, 39 (12)
  • [19] The application of convolution neural network based cell segmentation during cryopreservation
    Mbogba, Momoh Karmah
    Haider, Zeeshan
    Hossain, S. M. Chapal
    Huang, Daobin
    Memon, Kashan
    Panhwar, Fazil
    Lei, Zeling
    Zhao, Gang
    CRYOBIOLOGY, 2018, 85 : 95 - 104
  • [20] Deep Convolution Neural Network-Based Crack Feature Extraction, Detection and Quantification
    Teng, Shuai
    Chen, Gongfa
    JOURNAL OF FAILURE ANALYSIS AND PREVENTION, 2022, 22 (03) : 1308 - 1321