A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation

被引：65

作者：

Laves, Max-Heinrich ^{[1
]}

Bicker, Jens ^{[1
]}

Kahrs, Lueder A. ^{[1
]}

Ortmaier, Tobias ^{[1
]}

机构：

[1] Leibniz Univ Hannover, Appelstr 11A, D-30167 Hannover, Germany

来源：

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY | 2019年 / 14卷 / 03期

关键词：

Computer vision; Larynx; Vocal folds; Soft tissue; Open-access dataset; Machine learning; Patient-to-patient fine-tuning; SOFT-TISSUE MOTION; CLASSIFICATION; TRACKING;

D O I：

10.1007/s11548-018-01910-0

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

PurposeAutomated segmentation of anatomical structures in medical image analysis is a prerequisite for autonomous diagnosis as well as various computer- and robot-aided interventions. Recent methods based on deep convolutional neural networks (CNN) have outperformed former heuristic methods. However, those methods were primarily evaluated on rigid, real-world environments. In this study, existing segmentation methods were evaluated for their use on a new dataset of transoral endoscopic exploration.MethodsFour machine learning-based methods SegNet, UNet, ENet and ErfNet were trained with supervision on a novel 7-class dataset of the human larynx. The dataset contains 536 manually segmented images from two patients during laser incisions. The Intersection-over-Union (IoU) evaluation metric was used to measure the accuracy of each method. Data augmentation and network ensembling were employed to increase segmentation accuracy. Stochastic inference was used to show uncertainties of the individual models. Patient-to-patient transfer was investigated using patient-specific fine-tuning.ResultsIn this study, a weighted average ensemble network of UNet and ErfNet was best suited for the segmentation of laryngeal soft tissue with a mean IoU of 84.7%. The highest efficiency was achieved by ENet with a mean inference time of 9.22ms per image. It is shown that 10 additional images from a new patient are sufficient for patient-specific fine-tuning.ConclusionCNN-based methods for semantic segmentation are applicable to endoscopic images of laryngeal soft tissue. The segmentation can be used for active constraints or to monitor morphological changes and autonomously detect pathologies. Further improvements could be achieved by using a larger dataset or training the models in a self-supervised manner on additional unlabeled data.

引用

页码：483 / 492

页数：10

共 50 条

[11] Intelligent weight prediction of cows based on semantic segmentation and back propagation neural network
Xu, Beibei
Mao, Yifan
Wang, Wensheng
Chen, Guipeng
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[12] Automatic glottis segmentation for laryngeal endoscopic images based on U-Net
Ding, Huijun
Cen, Qian
Si, Xiaoyu
Pan, Zhanpeng
Chen, Xiangdong
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
[13] Combined Transfer Learning and Test-Time Augmentation Improves Convolutional Neural Network-Based Semantic Segmentation of Prostate Cancer from Multi-Parametric MR Images
Hoar, David
Lee, Peter Q.
Guida, Alessandro
Patterson, Steven
Bowen, Chris, V
Merrimen, Jennifer
Wang, Cheng
Rendon, Ricardo
Beyea, Steven D.
Clarke, Sharon E.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 210 (210)
[14] Comparative Study of Neural Network- Based Approaches for QRS Segmentation
Kolokolnikov, George
Borde, Anna
Skuratov, Victor
Gaponov, Roman
Rumyantseva, Anastasiya
INTERNATIONAL JOURNAL OF EMBEDDED AND REAL-TIME COMMUNICATION SYSTEMS (IJERTCS), 2020, 11 (04): : 80 - 103
[15] A convolutional neural network model for semantic segmentation of mitotic events in microscopy images
Orturk, Saban
Akdemir, Bayram
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) : 3719 - 3728
[16] Classification of endoscopic images based on texture and neural network
Wang, P
Krishnan, SM
Kugean, C
Tjoa, MP
PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 3691 - 3695
[17] Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images
Wang, Hongzhen
Wang, Ying
Zhang, Qian
Xiang, Shiming
Pan, Chunhong
REMOTE SENSING, 2017, 9 (05)
[18] Semantic Segmentation of Remote Sensing Image Based on Neural Network
Wang Ende
Qi Kai
Li Xuepeng
Peng Liangyu
ACTA OPTICA SINICA, 2019, 39 (12)
[19] The application of convolution neural network based cell segmentation during cryopreservation
Mbogba, Momoh Karmah
Haider, Zeeshan
Hossain, S. M. Chapal
Huang, Daobin
Memon, Kashan
Panhwar, Fazil
Lei, Zeling
Zhao, Gang
CRYOBIOLOGY, 2018, 85 : 95 - 104
[20] Deep Convolution Neural Network-Based Crack Feature Extraction, Detection and Quantification
Teng, Shuai
Chen, Gongfa
JOURNAL OF FAILURE ANALYSIS AND PREVENTION, 2022, 22 (03) : 1308 - 1321

← 1 2 3 4 5 →