Segmentation of endoscopy images of anterior nasal cavity using deep learning

被引:1
作者
Phoommanee, Nonpawith [1 ]
Andrews, Peter J. [2 ,3 ]
Leung, Terence S. [1 ]
机构
[1] UCL, Dept Med Phys & Biomed Engn, London WC1E 6BT, England
[2] Royal Natl Throat Nose & Ear Hosp, Dept Rhinol & Facial Plast Surg, London WC1E 6DG, England
[3] UCL, UCL Ear Inst, London WC1X 8EE, England
来源
COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024 | 2024年 / 12927卷
关键词
nasal obstruction; segmentation; deep learning; transfer learning; low-light image enhancement;
D O I
10.1117/12.2691427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nasal obstruction (NO), which affects one-third of the adult population, is characterized by a blockage in the nasal cavity. Rhinologists commonly employ nasal endoscopy (NE) in the differential diagnosis of NO, along with a focused history and other examinations such as skin prick tests and CT scans. This study aims to establish NE as a reliable standalone diagnostic tool, eliminating the necessity for CT scans and skin prick tests in the diagnosis of NO. However, currently, there is a lack of objective methods to quantify the severity of NO. To address this problem, we used deep learning to identify the anatomical structures of the anterior nasal cavity, which will then be graded by an objective grading system. In this paper, we evaluated the performance of various deep learning methods (DeepLabv3+, MaskFormer, and Mask2Former) with different pre-trained backbones (ResNet-101 - CNN-based, and Swin-Tiny - transformer-based), for semantic segmentation of the anterior nasal cavity. Sixty-two participants were examined with NE before and after using a nasal decongestant. For model training and validation, 608 images from 46 participants were utilized, and 171 images from 16 participants were reserved for testing. The fine-tuned Mask2Former with low-light image enhancement achieved a mean intersection-over-union of 81.7% and 61.2% on the validation and testing sets, respectively. These findings represent the first successful semantic segmentation of key anatomical structures within the anterior nasal cavity. These segmented structures will serve as the basis for classifying the severity of NO and diagnosing NO conditions, enabling AI-based consultations in primary care settings such as general practices and remote locations, where access to ENT expertise may be limited.
引用
收藏
页数:5
相关论文
共 13 条
[11]   Position Paper on Nasal Obstruction: Evaluation and Treatment [J].
Valero, A. ;
Navarro, A. M. ;
del Cuvillo, A. ;
Alobid, I ;
Benito, J. R. ;
Colas, C. ;
de los Santos, G. ;
Liesa R, Fernandez ;
Garcia-Lliberos, A. ;
Gonzalez-Perez, R. ;
Izquierdo-Dominguez, A. ;
Jurado-Ramos, A. ;
Lluch-Bernal, M. M. ;
Montserrat Gili, J. R. ;
Mullol, J. ;
Casas A, Puiggros ;
Sanchez-Hernandez, M. C. ;
Vega, F. ;
Villacampa, J. M. ;
Armengot-Carceller, M. ;
Dordal, M. T. .
JOURNAL OF INVESTIGATIONAL ALLERGOLOGY AND CLINICAL IMMUNOLOGY, 2018, 28 (02) :67-90
[12]   Scene Parsing through ADE20K Dataset [J].
Zhou, Bolei ;
Zhao, Hang ;
Puig, Xavier ;
Fidler, Sanja ;
Barriuso, Adela ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5122-5130
[13]   Reliability assessment of the endoscopic examination in patients with allergic rhinitis [J].
Ziade, Georges K. ;
Karami, Reem A. ;
Fakhri, Ghina B. ;
Alam, Elie S. ;
Hamdan, Abdul Latif ;
Mourad, Marc M. ;
Hadi, Usama M. .
ALLERGY & RHINOLOGY, 2016, 7 (03) :E135-E138