A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation

被引:65
作者
Laves, Max-Heinrich [1 ]
Bicker, Jens [1 ]
Kahrs, Lueder A. [1 ]
Ortmaier, Tobias [1 ]
机构
[1] Leibniz Univ Hannover, Appelstr 11A, D-30167 Hannover, Germany
关键词
Computer vision; Larynx; Vocal folds; Soft tissue; Open-access dataset; Machine learning; Patient-to-patient fine-tuning; SOFT-TISSUE MOTION; CLASSIFICATION; TRACKING;
D O I
10.1007/s11548-018-01910-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
PurposeAutomated segmentation of anatomical structures in medical image analysis is a prerequisite for autonomous diagnosis as well as various computer- and robot-aided interventions. Recent methods based on deep convolutional neural networks (CNN) have outperformed former heuristic methods. However, those methods were primarily evaluated on rigid, real-world environments. In this study, existing segmentation methods were evaluated for their use on a new dataset of transoral endoscopic exploration.MethodsFour machine learning-based methods SegNet, UNet, ENet and ErfNet were trained with supervision on a novel 7-class dataset of the human larynx. The dataset contains 536 manually segmented images from two patients during laser incisions. The Intersection-over-Union (IoU) evaluation metric was used to measure the accuracy of each method. Data augmentation and network ensembling were employed to increase segmentation accuracy. Stochastic inference was used to show uncertainties of the individual models. Patient-to-patient transfer was investigated using patient-specific fine-tuning.ResultsIn this study, a weighted average ensemble network of UNet and ErfNet was best suited for the segmentation of laryngeal soft tissue with a mean IoU of 84.7%. The highest efficiency was achieved by ENet with a mean inference time of 9.22ms per image. It is shown that 10 additional images from a new patient are sufficient for patient-specific fine-tuning.ConclusionCNN-based methods for semantic segmentation are applicable to endoscopic images of laryngeal soft tissue. The segmentation can be used for active constraints or to monitor morphological changes and autonomously detect pathologies. Further improvements could be achieved by using a larger dataset or training the models in a self-supervised manner on additional unlabeled data.
引用
收藏
页码:483 / 492
页数:10
相关论文
共 50 条
  • [21] Convolutional Neural Network-Based Remote Sensing Images Segmentation Method for Extracting Winter Wheat Spatial Distribution
    Zhang, Chengming
    Gao, Shuai
    Yang, Xiaoxia
    Li, Feng
    Yue, Maorui
    Han, Yingjuan
    Zhao, Hui
    Zhang, Ya'nan
    Fan, Keqi
    APPLIED SCIENCES-BASEL, 2018, 8 (10):
  • [22] Retinal Vessel Segmentation Using Densely Connected Convolution Neural Network with Colorful Fundus Images
    Liu, Ze-Fan
    Zhang, Yu-Zhao
    Liu, Pei-Zhong
    Zhang, Yong
    Luo, Yan-Min
    Du, Yong-Zhao
    Peng, Yan
    Li, Ping
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (06) : 1300 - 1307
  • [23] Learnable Gated Convolutional Neural Network for Semantic Segmentation in Remote-Sensing Images
    Guo, Shichen
    Jin, Qizhao
    Wang, Hongzhen
    Wang, Xuezhi
    Wang, Yangang
    Xiang, Shiming
    REMOTE SENSING, 2019, 11 (16)
  • [24] Automated semantic lung segmentation in chest CT images using deep neural network
    Murugappan, M.
    Bourisly, Ali K. K.
    Prakash, N. B.
    Sumithra, M. G.
    Acharya, U. Rajendra
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (21) : 15343 - 15364
  • [25] Images data practices for Semantic Segmentation of Breast Cancer using Deep Neural Network
    Ahmed, Luqman
    Iqbal, Muhammad Munwar
    Aldabbas, Hamza
    Khalid, Shehzad
    Saleem, Yasir
    Saeed, Saqib
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 14 (11) : 15227 - 15243
  • [26] Application of neural network algorithms for semantic segmentation of satellite images of the Earth's surface
    Druki, Alexey A.
    Spitsyn, Vladimir G.
    VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA-UPRAVLENIE VYCHISLITELNAJA TEHNIKA I INFORMATIKA-TOMSK STATE UNIVERSITY JOURNAL OF CONTROL AND COMPUTER SCIENCE, 2023, (63): : 62 - 71
  • [27] Exploring uncertainty measures in convolutional neural network for semantic segmentation of oral cancer images
    Song, Bofan
    Li, Shaobai
    Sunny, Sumsum
    Gurushanth, Keerthi
    Mendonca, Pramila
    Mukhia, Nirza
    Patrick, Sanjana
    Peterson, Tyler
    Gurudath, Shubha
    Raghavan, Subhashini
    Tsusennaro, Imchen
    Leivon, Shirley T.
    Kolur, Trupti
    Shetty, Vivek
    Bushan, Vidya
    Ramesh, Rohan
    Pillai, Vijay
    Wilder-Smith, Petra
    Suresh, Amritha
    Kuriakose, Moni Abraham
    Birur, Praveen
    Liang, Rongguang
    JOURNAL OF BIOMEDICAL OPTICS, 2022, 27 (11)
  • [28] Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images
    Wang, Libo
    Li, Rui
    Wang, Dongzhi
    Duan, Chenxi
    Wang, Teng
    Meng, Xiaoliang
    REMOTE SENSING, 2021, 13 (16)
  • [29] DCNNBT: A NOVEL DEEP CONVOLUTION NEURAL NETWORK-BASED BRAIN TUMOR CLASSIFICATION MODEL
    Haq, Mohd Anul
    Khan, Ilyas
    Ahmed, Ahsan
    Eldin, Sayed M.
    Alshehri, Ali
    Ghamry, Nivin A.
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
  • [30] A lightweight convolutional neural network-based feature extractor for visible images
    He, Xujie
    Jin, Jing
    Jiang, Yu
    Li, Dandan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249