Interpretable Computer Vision to Detect and Classify Structural Laryngeal Lesions in Digital Flexible Laryngoscopic Images

被引：16

作者：

Bur, Andres M. ^{[1
,5
]}

Zhang, Tianxiao ^{[2
]}

Chen, Xiangyu ^{[2
]}

Kavookjian, Hannah ^{[1
]}

Kraft, Shannon ^{[1
]}

Karadaghy, Omar ^{[1
]}

Farrokhian, Nathan ^{[1
]}

Mussatto, Caroline ^{[3
]}

Penn, Joseph ^{[3
]}

Wang, Guanghui ^{[4
]}

机构：

[1] Univ Kansas, Dept Otolaryngol Head & Neck Surg, Med Ctr, Kansas City, KS USA

[2] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS USA

[3] Univ Kansas, Sch Med, Kansas City, KS USA

[4] Toronto Metropolitan Univ, Dept Comp Sci, Toronto, ON, Canada

[5] Univ Kansas, Dept Otolaryngol Head & Neck Surg, Sch Med, 3901 Rainbow Blvd, Kansas City, KS 66160 USA

来源：

OTOLARYNGOLOGY-HEAD AND NECK SURGERY | 2023年 / 169卷 / 06期

基金：

美国国家卫生研究院;

关键词：

artificial intelligence; detection; laryngeal cancer; laryngoscopy; neural networks; COMPETENCE;

D O I：

10.1002/ohn.411

中图分类号：

R76 [耳鼻咽喉科学];

学科分类号：

100213 ;

摘要：

ObjectiveTo localize structural laryngeal lesions within digital flexible laryngoscopic images and to classify them as benign or suspicious for malignancy using state-of-the-art computer vision detection models. Study DesignCross-sectional diagnostic study SettingTertiary care voice clinic MethodsDigital stroboscopic videos, demographic and clinical data were collected from patients evaluated for a structural laryngeal lesion. Laryngoscopic images were extracted from videos and manually labeled with bounding boxes encompassing the lesion. Four detection models were employed to simultaneously localize and classify structural laryngeal lesions in laryngoscopic images. Classification accuracy, intersection over union (IoU) and mean average precision (mAP) were evaluated as measures of classification, localization, and overall performance, respectively. ResultsIn total, 8,172 images from 147 patients were included in the laryngeal image dataset. Classification accuracy was 88.5 for individual laryngeal images and increased to 92.0 when all images belonging to the same sequence (video) were considered. Mean average precision across all four detection models was 50.1 using an IoU threshold of 0.5 to determine successful localization. ConclusionResults of this study showed that deep neural network-based detection models trained using a labeled dataset of digital laryngeal images have the potential to classify structural laryngeal lesions as benign or suspicious for malignancy and to localize them within an image. This approach provides valuable insight into which part of the image was used by the model to determine a diagnosis, allowing clinicians to independently evaluate models' predictions.

引用

页码：1564 / 1572

页数：9

共 20 条

[11] Feature Pyramid Networks for Object Detection [J].

Lin, Tsung-Yi ;

Dollar, Piotr ;

Girshick, Ross ;

He, Kaiming ;

Hariharan, Bharath ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944

[12] Universal adversarial perturbations [J].

Moosavi-Dezfooli, Seyed-Mohsen ;

Fawzi, Alhussein ;

Fawzi, Omar ;

Frossard, Pascal .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :86-94

[13] Automatic Recognition of Laryngoscopic Images Using a Deep-Learning Technique [J].

Ren, Jianjun ;

Jing, Xueping ;

Wang, Jing ;

Ren, Xue ;

Xu, Yang ;

Yang, Qiuyun ;

Ma, Lanzhi ;

Sun, Yi ;

Xu, Wei ;

Yang, Ning ;

Zou, Jian ;

Zheng, Yongbo ;

Chen, Min ;

Gan, Weigang ;

Xiang, Ting ;

An, Junnan ;

Liu, Ruiqing ;

Lv, Cao ;

Lin, Ken ;

Zheng, Xianfeng ;

Lou, Fan ;

Rao, Yufang ;

Yang, Hui ;

Liu, Kai ;

Liu, Geoffrey ;

Lu, Tao ;

Zheng, Xiujuan ;

Zhao, Yu .

LARYNGOSCOPE, 2020, 130 (11) :E686-E693

[14] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[15] Harnessing the Power of Artificial Intelligence in Otolaryngology and the Communication Sciences [J].

Wilson, Blake S. ;

Tucci, Debara L. ;

Moses, David A. ;

Chang, Edward F. ;

Young, Nancy M. ;

Zeng, Fan-Gang ;

Lesica, Nicholas A. ;

Bur, Andres M. ;

Kavookjian, Hannah ;

Mussatto, Caroline ;

Penn, Joseph ;

Goodwin, Sara ;

Kraft, Shannon ;

Wang, Guanghui ;

Cohen, Jonathan M. ;

Ginsburg, Geoffrey S. ;

Dawson, Geraldine ;

Francis, Howard W. .

JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2022, 23 (03) :319-349

[16] A "Medical Mission" at Home: The Needs of Rural America in Terms of Otolaryngology Care [J].

Winters, Ryan ;

Pou, Anna ;

Friedlander, Paul .

JOURNAL OF RURAL HEALTH, 2011, 27 (03) :297-301

[17] Applications of Artificial Intelligence to Office Laryngoscopy: A Scoping Review [J].

Yao, Peter ;

Usman, Moon ;

Chen, Yu H. ;

German, Alexander ;

Andreadis, Katerina ;

Mages, Keith ;

Rameau, Anais .

LARYNGOSCOPE, 2022, 132 (10) :1993-2016

[18] VarifocalNet: An IoU-aware Dense Object Detector [J].

Zhang, Haoyang ;

Wang, Ying ;

Dayoub, Feras ;

Sunderhauf, Niko .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8510-8519

[19] Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection [J].

Zhang, Shifeng ;

Chi, Cheng ;

Yao, Yongqiang ;

Lei, Zhen ;

Li, Stan Z. .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9756-9765

[20] Artificial Intelligence in Laryngeal Endoscopy: Systematic Review and Meta-Analysis [J].

Zurek, Michal ;

Jasak, Kamil ;

Niemczyk, Kazimierz ;

Rzepakowska, Anna .

JOURNAL OF CLINICAL MEDICINE, 2022, 11 (10)

← 1 2 →