Segmentation of endoscopy images of anterior nasal cavity using deep learning

被引:1
作者
Phoommanee, Nonpawith [1 ]
Andrews, Peter J. [2 ,3 ]
Leung, Terence S. [1 ]
机构
[1] UCL, Dept Med Phys & Biomed Engn, London WC1E 6BT, England
[2] Royal Natl Throat Nose & Ear Hosp, Dept Rhinol & Facial Plast Surg, London WC1E 6DG, England
[3] UCL, UCL Ear Inst, London WC1X 8EE, England
来源
COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024 | 2024年 / 12927卷
关键词
nasal obstruction; segmentation; deep learning; transfer learning; low-light image enhancement;
D O I
10.1117/12.2691427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nasal obstruction (NO), which affects one-third of the adult population, is characterized by a blockage in the nasal cavity. Rhinologists commonly employ nasal endoscopy (NE) in the differential diagnosis of NO, along with a focused history and other examinations such as skin prick tests and CT scans. This study aims to establish NE as a reliable standalone diagnostic tool, eliminating the necessity for CT scans and skin prick tests in the diagnosis of NO. However, currently, there is a lack of objective methods to quantify the severity of NO. To address this problem, we used deep learning to identify the anatomical structures of the anterior nasal cavity, which will then be graded by an objective grading system. In this paper, we evaluated the performance of various deep learning methods (DeepLabv3+, MaskFormer, and Mask2Former) with different pre-trained backbones (ResNet-101 - CNN-based, and Swin-Tiny - transformer-based), for semantic segmentation of the anterior nasal cavity. Sixty-two participants were examined with NE before and after using a nasal decongestant. For model training and validation, 608 images from 46 participants were utilized, and 171 images from 16 participants were reserved for testing. The fine-tuned Mask2Former with low-light image enhancement achieved a mean intersection-over-union of 81.7% and 61.2% on the validation and testing sets, respectively. These findings represent the first successful semantic segmentation of key anatomical structures within the anterior nasal cavity. These segmented structures will serve as the basis for classifying the severity of NO and diagnosing NO conditions, enabling AI-based consultations in primary care settings such as general practices and remote locations, where access to ENT expertise may be limited.
引用
收藏
页数:5
相关论文
共 13 条
[1]   CaMap: Camera-based Map Manipulation on Mobile Devices [J].
Chen, Liang ;
Chen, Dongyi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[2]  
Cheng B, 2021, ADV NEUR IN, V34
[3]   Masked-attention Mask Transformer for Universal Image Segmentation [J].
Cheng, Bowen ;
Misra, Ishan ;
Schwing, Alexander G. ;
Kirillov, Alexander ;
Girdhar, Rohit .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289
[4]  
Everingham M, 2012, The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results
[5]  
Fokkens WJ, 2020, European position paper on rhinosinusitis and nasal polyps 2020
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Anatomy and Physiology of Nasal Obstruction [J].
Hsu, David W. ;
Suh, Jeffrey D. .
OTOLARYNGOLOGIC CLINICS OF NORTH AMERICA, 2018, 51 (05) :853-+
[8]   Diagnosing nasal obstruction and its common causes using the nasal acoustic device: A pilot study [J].
Li, Chia-Hung ;
Kaura, Anika ;
Tan, Calvin ;
Whitcroft, Katherine L. ;
Leung, Terence S. ;
Andrews, Peter .
LARYNGOSCOPE INVESTIGATIVE OTOLARYNGOLOGY, 2020, 5 (05) :796-806
[9]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[10]   The internal nasal valve: a validated grading system and operative guide [J].
Patel, B. ;
Virk, J. S. ;
Randhawa, P. S. ;
Andrews, P. J. .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2018, 275 (11) :2739-2744