Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network-Based Image Classification System

被引:43
作者
Cho, Won Ki [1 ]
Lee, Yeong Ju [1 ]
Joo, Hye Ah [1 ]
Jeong, In Seong [1 ]
Choi, Yeonjoo [1 ]
Nam, Soon Yuhl [1 ]
Kim, Sang Yoon [1 ]
Choi, Seung-Ho [1 ]
机构
[1] Univ Ulsan, Dept Otorhinolaryngol Head & Neck Surg, Asan Med Ctr, Coll Med, 88 Olymp Ro 43 Gil, Seoul 05505, South Korea
关键词
Laryngoscopic images; laryngeal disease; deep Learning; neural networks; computer diagnosis; computer-aided diagnosis;
D O I
10.1002/lary.29595
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Objectives/Hypothesis: There may be an interobserver variation in the diagnosis of laryngeal disease based on laryngoscopic images according to clinical experience. Therefore, this study is aimed to perform computer-assisted diagnosis for common laryngeal diseases using deep learning-based disease classification models. Study Design: Experimental study with retrospective data Methods: A total of 4106 images (cysts, nodules, polyps, leukoplakia, papillomas, Reinke's edema, granulomas, palsies, and normal cases) were analyzed. After equal distribution of diseases into ninefolds, stratified eightfold cross-validation was performed for training, validation process and remaining onefold was used as a test dataset. A trained model was applied to test sets, and model performance was assessed for precision (positive predictive value), recall (sensitivity), accuracy, F1 score, precision-recall (PR) curve, and PR-area under the receiver operating characteristic curve (PR-AUC). Outcomes were compared to those of visual assessments by four trainees. Results: The trained deep neural networks (DNNs) outperformed trainees' visual assessments in discriminating cysts, granulomas, nodules, normal cases, palsies, papillomas, and polyps according to the PR-AUC and F1 score. The lowest F1 score and PR-AUC of DNNs were estimated for Reinke's edema (0.720, 0.800) and nodules (0.730, 0.780) but were comparable to the mean of the two trainees' F1 score with the best performances (0.765 and 0.675, respectively). In discriminating papillomas, the F1 score was much higher for DNNs (0.870) than for trainees (0.685). Overall, DNNs outperformed all trainees (micro-average PR-AUC = 0.95; macro-average PR-AUC = 0.91). Conclusions: DNN technology could be applied to laryngoscopy to supplement clinical assessment of examiners by providing additional diagnostic clues and having a role as a reference of diagnosis.
引用
收藏
页码:2558 / 2566
页数:9
相关论文
共 22 条
[1]   An Open-Source Computer Vision Tool for Automated Vocal Fold Tracking From Videoendoscopy [J].
Adamian, Nat ;
Naunheim, Matthew R. ;
Jowett, Nate .
LARYNGOSCOPE, 2021, 131 (01) :E219-E225
[2]   Comparison of Convolutional Neural Network Models for Determination of Vocal Fold Normality in Laryngoscopic Images [J].
Cho, Won Ki ;
Choi, Seung-Ho .
JOURNAL OF VOICE, 2022, 36 (05) :590-598
[3]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[4]  
Davis J., 2006, P 23 INT C MACH LEAR, P233, DOI [10.1145/1143844.1143874, DOI 10.1145/1143844.1143874]
[5]   Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network [J].
Fehling, Mona Kirstin ;
Grosch, Fabian ;
Schuster, Maria Elke ;
Schick, Bernhard ;
Lohscheller, Joerg .
PLOS ONE, 2020, 15 (02)
[6]   Exudative lesions of Reinke's space: a terminology proposal [J].
Hantzakos, A. ;
Remacle, M. ;
Dikkers, F. G. ;
Degols, J. -C. ;
Delos, M. ;
Friedrich, G. ;
Giovanni, A. ;
Rasmussen, N. .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2009, 266 (06) :869-878
[7]  
Iandola FN, ABS160207360 ARXIV
[8]   A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation [J].
Laves, Max-Heinrich ;
Bicker, Jens ;
Kahrs, Lueder A. ;
Ortmaier, Tobias .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (03) :483-492
[9]   Diagnostic Evaluation and Management of Hoarseness [J].
Mau, Ted .
MEDICAL CLINICS OF NORTH AMERICA, 2010, 94 (05) :945-+
[10]   Machine Learning in Laryngoscopy Analysis: A Proof of Concept Observational Study for the Identification of Post-Extubation Ulcerations and Granulomas [J].
Parker, Felix ;
Brodsky, Martin B. ;
Akst, Lee M. ;
Ali, Haider .
ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2021, 130 (03) :286-291