Multi-instance learning based artificial intelligence model to assist vocal fold leukoplakia diagnosis: A multicentre diagnostic study

被引:5
作者
Wang, Mei-Ling [1 ,2 ]
Tie, Cheng-Wei [1 ]
Wang, Jian-Hui [1 ,3 ]
Zhu, Ji-Qing
Chen, Bing -Hong [1 ,2 ]
Li, Ying [1 ,2 ]
Zhang, Sen [4 ]
Liu, Lin [5 ]
Guo, Li [6 ]
Yang, Long [7 ]
Yang, Li-Qun [7 ]
Wei, Jiao [8 ]
Jiang, Feng [9 ]
Zhao, Zhi-Qiang [10 ]
Wang, Gui-Qi [1 ]
Zhang, Wei [1 ,2 ]
Zhang, Quan-Mao [3 ]
Ni, Xiao-Guang [1 ]
机构
[1] Chinese Acad Med Sci, Peking Union Med Coll, Dept Endoscopy, Natl Canc Ctr,Natl Clin Res Ctr Canc,Canc Hosp, 17 Panjiayuan South Lane, Shenzhen 100021, Peoples R China
[2] Chinese Acad Med Sci, Peking Union Med Coll, Shenzhen Hosp, Shenzhen, Peoples R China
[3] Chinese Acad Med Sci, Canc Hosp, Shanxi Med Univ, Dept Endoscopy,Shanxi Prov Canc Hosp,Shanxi Hosp, Taiyuan, Peoples R China
[4] Shanxi Med Univ, Hosp 1, Dept Otolaryngol Head & Neck Surg, Taiyuan, Peoples R China
[5] Dalian Friendship Hosp, Dept Otolaryngol Head & Neck Surg, Dalian, Peoples R China
[6] Henan Univ Sci & Technol, Coll Clin Med, Affiliated Hosp 1, Dept Otolaryngol Head & Neck Surg, Luoyang, Peoples R China
[7] Second Peoples Hosp Baoshan City, Dept Otolaryngol, Baoshan, Peoples R China
[8] Qujing Second Peoples Hosp Yunnan Prov, Dept Otolaryngol, Qujing, Peoples R China
[9] Kunming First Peoples Hosp, Dept Otolaryngol, Kunming, Yunnan, Peoples R China
[10] Baoshan Peoples Hosp, Dept Otolaryngol, Baoshan, Peoples R China
关键词
Vocal fold leukoplakia; Laryngoscopy; Artificial intelligence; Segmentation; Multi -instance learning; CLASSIFICATION;
D O I
10.1016/j.amjoto.2024.104342
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
Objective: To develop a multi -instance learning (MIL) based artificial intelligence (AI) -assisted diagnosis models by using laryngoscopic images to differentiate benign and malignant vocal fold leukoplakia (VFL). Methods: The AI system was developed, trained and validated on 5362 images of 551 patients from three hospitals. Automated regions of interest (ROI) segmentation algorithm was utilized to construct image -level features. MIL was used to fusion image level results to patient level features, then the extracted features were modeled by seven machine learning algorithms. Finally, we evaluated the image level and patient level results. Additionally, 50 videos of VFL were prospectively gathered to assess the system's real-time diagnostic capabilities. A human -machine comparison database was also constructed to compare the diagnostic performance of otolaryngologists with and without AI assistance. Results: In internal and external validation sets, the maximum area under the curve (AUC) for image level segmentation models was 0.775 (95 % CI 0.740-0.811) and 0.720 (95 % CI 0.684-0.756), respectively. Utilizing a MIL -based fusion strategy, the AUC at the patient level increased to 0.869 (95 % CI 0.798-0.940) and 0.851 (95 % CI 0.756-0.945). For real-time video diagnosis, the maximum AUC at the patient level reached 0.850 (95 % CI, 0.743-0.957). With AI assistance, the AUC improved from 0.720 (95 % CI 0.682-0.755) to 0.808 (95 % CI 0.775-0.839) for senior otolaryngologists and from 0.647 (95 % CI 0.608-0.686) to 0.807 (95 % CI 0.773-0.837) for junior otolaryngologists.
引用
收藏
页数:9
相关论文
共 31 条
[1]   Deep Learning Applied to White Light and Narrow Band Imaging Videolaryngoscopy: Toward Real-Time Laryngeal Cancer Detection [J].
Azam, Muhammad Adeel ;
Sampieri, Claudio ;
Ioppi, Alessandro ;
Africano, Stefano ;
Vallin, Alberto ;
Mocellin, Davide ;
Fragale, Marco ;
Guastini, Luca ;
Moccia, Sara ;
Piazza, Cesare ;
Mattos, Leonardo S. ;
Peretti, Giorgio .
LARYNGOSCOPE, 2022, 132 (09) :1798-1806
[2]  
Barnes L, 2005, Head and neck tumours, P140
[3]   Multiple instance learning: A survey of problem characteristics and applications [J].
Carbonneau, Marc-Andre ;
Cheplygina, Veronika ;
Granger, Eric ;
Gagnon, Ghyslain .
PATTERN RECOGNITION, 2018, 77 :329-353
[4]   Nonsurgical Treatment for Vocal Fold Leukoplakia: An Analysis of 178 Cases [J].
Chen, Min ;
Cheng, Lei ;
Li, Chang-jiang ;
Chen, Jian ;
Shu, Yi-lai ;
Wu, Hai-tao .
BIOMED RESEARCH INTERNATIONAL, 2017, 2017
[5]   Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network-Based Image Classification System [J].
Cho, Won Ki ;
Lee, Yeong Ju ;
Joo, Hye Ah ;
Jeong, In Seong ;
Choi, Yeonjoo ;
Nam, Soon Yuhl ;
Kim, Sang Yoon ;
Choi, Seung-Ho .
LARYNGOSCOPE, 2021, 131 (11) :2558-2566
[6]   Cross-Scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images [J].
Deng, Ruining ;
Cui, Can ;
Remedios, Lucas W. ;
Bao, Shunxing ;
Womick, R. Michael ;
Chiron, Sophie ;
Li, Jia ;
Roland, Joseph T. ;
Lau, Ken S. ;
Liu, Qi ;
Wilson, Keith T. ;
Wang, Yaohong ;
Coburn, Lori A. ;
Landman, Bennett A. ;
Huo, Yuankai .
MULTISCALE MULTIMODAL MEDICAL IMAGING, MMMI 2022, 2022, 13594 :24-33
[7]   Deep Convolution Neural Network for Laryngeal Cancer Classification on Contact Endoscopy-Narrow Band Imaging [J].
Esmaeili, Nazila ;
Sharaf, Esam ;
Gomes Ataide, Elmer Jeto ;
Illanes, Alfredo ;
Boese, Axel ;
Davaris, Nikolaos ;
Arens, Christoph ;
Navab, Nassir ;
Friebe, Michael .
SENSORS, 2021, 21 (23)
[8]   Laryngeal Lesion Classification Based on Vascular Patterns in Contact Endoscopy and Narrow Band Imaging: Manual Versus Automatic Approach [J].
Esmaeili, Nazila ;
Illanes, Alfredo ;
Boese, Axel ;
Davaris, Nikolaos ;
Arens, Christoph ;
Navab, Nassir ;
Friebe, Michael .
SENSORS, 2020, 20 (14) :1-12
[9]   A deep convolutional neural network-based method for laryngeal squamous cell carcinoma diagnosis [J].
He, Yurong ;
Cheng, Yingduan ;
Huang, Zhigang ;
Xu, Wen ;
Hu, Rong ;
Cheng, Liyu ;
He, Shizhi ;
Yue, Changli ;
Qin, Gang ;
Wang, Yan ;
Zhong, Qi .
ANNALS OF TRANSLATIONAL MEDICINE, 2021, 9 (24)
[10]   Management of Laryngeal Dysplasia and Early Invasive Cancer [J].
Hrelec, Candace .
CURRENT TREATMENT OPTIONS IN ONCOLOGY, 2021, 22 (10)