High-Speed Videoendoscopy Enhances the Objective Assessment of Glottic Organic Lesions: A Case-Control Study with Multivariable Data-Mining Model Development

被引:3
作者
Malinowski, Jakub [1 ]
Pietruszewska, Wioletta [1 ]
Stawiski, Konrad [2 ,3 ]
Kowalczyk, Magdalena [1 ]
Baranska, Magda [1 ]
Rycerz, Aleksander [3 ]
Niebudek-Bogusz, Ewa [1 ]
机构
[1] Med Univ Lodz, Dept Otolaryngol Head & Neck Oncol, PL-90419 Lodz, Poland
[2] Harvard Med Sch, Dana Farber Canc Inst, Dept Radiat Oncol, Boston, MA 02115 USA
[3] Med Univ Lodz, Dept Biostat & Translat Med, PL-90419 Lodz, Poland
关键词
glottis organic pathology; glottic cancer; high-speed videoendoscopy; kymography; machine learning; VOCAL FOLD VIBRATION; VOICE; QUANTIFICATION; PATHOLOGIES; STROBOSCOPY;
D O I
10.3390/cancers15143716
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary The standard protocol for distinguishing between benign and malignant lesions remains clinical judgment and histopathological confirmation by an experienced otolaryngologist. An additional tool is high-speed videoendoscopy (HSV), an accurate method for an objective assessment of vocal fold oscillations. The aim of the study was to utilize a quantitative assessment of the vibratory characteristics of vocal folds in diagnosing benign and malignant lesions of the glottis using HSV. The machine learning model identifying malignancy among organic lesions reached an AUC equal to 0.85 and presented with 80.6% accuracy, 100% sensitivity, and 71.1% specificity on the training set. Important predictive factors were frequency perturbation measures. The results suggested that advanced machine learning models based on HSV analysis could potentially indicate a heightened risk of cancerous mass. Therefore, this technology could, in future, aid in early cancer detection; however, further investigation and validation is needed. The aim of the study was to utilize a quantitative assessment of the vibratory characteristics of vocal folds in diagnosing benign and malignant lesions of the glottis using high-speed videolaryngoscopy (HSV). Methods: Case-control study including 100 patients with unilateral vocal fold lesions in comparison to 38 normophonic subjects. Quantitative assessment with the determination of vocal fold oscillation parameters was performed based on HSV kymography. Machine-learning predictive models were developed and validated. Results: All calculated parameters differed significantly between healthy subjects and patients with organic lesions. The first predictive model distinguishing any organic lesion patients from healthy subjects reached an area under the curve (AUC) equal to 0.983 and presented with 89.3% accuracy, 97.0% sensitivity, and 71.4% specificity on the testing set. The second model identifying malignancy among organic lesions reached an AUC equal to 0.85 and presented with 80.6% accuracy, 100% sensitivity, and 71.1% specificity on the training set. Important predictive factors for the models were frequency perturbation measures. Conclusions: The standard protocol for distinguishing between benign and malignant lesions continues to be clinical evaluation by an experienced ENT specialist and confirmed by histopathological examination. Our findings did suggest that advanced machine learning models, which consider the complex interactions present in HSV data, could potentially indicate a heightened risk of malignancy. Therefore, this technology could prove pivotal in aiding in early cancer detection, thereby emphasizing the need for further investigation and validation.
引用
收藏
页数:15
相关论文
共 46 条
  • [31] Comparison of Videostroboscopy to Stroboscopy Derived From High-Speed Videoendoscopy for Evaluating Patients With Vocal Fold Mass Lesions
    Powell, Maria E.
    Deliyski, Dimitar D.
    Hillman, Robert E.
    Zeitels, Steven M.
    Burns, James A.
    Mehta, Daryush D.
    [J]. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2016, 25 (04) : 576 - 589
  • [32] Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
    Rodriguez-Perez, Raquel
    Bajorath, Juergen
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2020, 34 (10) : 1013 - 1026
  • [33] Stroboscopy as a research instrument: Development of a perceptual evaluation tool
    Rosen, CA
    [J]. LARYNGOSCOPE, 2005, 115 (03) : 423 - 428
  • [34] High speed glottographic diagnostics in laryngology
    Schade, G
    Müller, F
    [J]. HNO, 2005, 53 (12) : 1085 - +
  • [35] Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings
    Schlegel, Patrick
    Kniesburges, Stefan
    Duerr, Stephan
    Schuetzenberger, Anne
    Doellinger, Michael
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [36] Laryngeal High-Speed Videoendoscopy: Sensitivity of Objective Parameters towards Recording Frame Rate
    Schuetzenberger, Anne
    Kunduk, Melda
    Doellinger, Michael
    Alexiou, Christoph
    Dubrovskiy, Denis
    Semmler, Marion
    Seger, Anja
    Bohr, Christopher
    [J]. BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [37] Tabular data: Deep learning is not all you need
    Shwartz-Ziv, Ravid
    Armon, Amitai
    [J]. INFORMATION FUSION, 2022, 81 : 84 - 90
  • [38] Stawiski K, 2022, bioRxiv, DOI [10.1101/2022.06.01.494299, 10.1101/2022.06.01.494299, DOI 10.1101/2022.06.01.494299]
  • [39] Svec J.G., 2011, Handbook of Voice Assessment, P129
  • [40] High-speed Videolaryngoscopy: Quantitative Parameters of Glottal Area Waveforms and High-speed Kymography in Healthy Individuals
    Tsutsumi, Monike
    Isotani, Seiji
    Pimenta, Regina Aparecida
    Dajer, Maria Eugenia
    Hachiya, Adriana
    Tsuji, Domingos Hiroshi
    Tayama, Niro
    Yokonishi, Hisayuki
    Imagawa, Hiroshi
    Yamauchi, Akihito
    Takano, Shingo
    Sakakibara, Ken-Ichi
    Montagnoli, Arlindo Neto
    [J]. JOURNAL OF VOICE, 2017, 31 (03) : 282 - 290