Videomics of the Upper Aero-Digestive Tract Cancer: Deep Learning Applied to White Light and Narrow Band Imaging for Automatic Segmentation of Endoscopic Images

被引:25
作者
Azam, Muhammad Adeel [1 ]
Sampieri, Claudio [2 ,3 ]
Ioppi, Alessandro [2 ,3 ]
Benzi, Pietro [2 ,3 ]
Giordano, Giorgio Gregory [2 ,3 ]
De Vecchi, Marta [2 ,3 ]
Campagnari, Valentina [2 ,3 ]
Li, Shunlei [1 ]
Guastini, Luca [2 ,3 ]
Paderno, Alberto [4 ,5 ]
Moccia, Sara [6 ,7 ]
Piazza, Cesare [4 ,5 ]
Mattos, Leonardo S. [1 ]
Peretti, Giorgio [2 ,3 ]
机构
[1] Ist Italiano Tecnol, Dept Adv Robot, Genoa, Italy
[2] IRCCS Osped Policlin San Martino, Unit Otorhinolaryngol Head & Neck Surg, Genoa, Italy
[3] Univ Genoa, Dept Surg Sci & Integrated Diagnost DISC, Genoa, Italy
[4] Unit Otorhinolaryngol Head & Neck Surg, ASST Spedali Civili Brescia, Brescia, Italy
[5] Univ Brescia, Dept Med & Surg Specialties, Radiol Sci & Publ Hlth, Brescia, Italy
[6] Scuola Super Sant Anna, BioRobot Inst, Pisa, Italy
[7] Scuola Super Sant Anna, Dept Excellence Robot & AI, Pisa, Italy
来源
FRONTIERS IN ONCOLOGY | 2022年 / 12卷
关键词
larynx cancer; oral cancer; oropharynx cancer; machine learning; endoscopy; laryngoscopy; computer vision; otorhinolaryngology; HIGH-DEFINITION TELEVISION; SQUAMOUS-CELL CARCINOMA; RESECTION MARGINS; ORAL-CAVITY; LESIONS; NASOPHARYNX;
D O I
10.3389/fonc.2022.900451
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
IntroductionNarrow Band Imaging (NBI) is an endoscopic visualization technique useful for upper aero-digestive tract (UADT) cancer detection and margins evaluation. However, NBI analysis is strongly operator-dependent and requires high expertise, thus limiting its wider implementation. Recently, artificial intelligence (AI) has demonstrated potential for applications in UADT videoendoscopy. Among AI methods, deep learning algorithms, and especially convolutional neural networks (CNNs), are particularly suitable for delineating cancers on videoendoscopy. This study is aimed to develop a CNN for automatic semantic segmentation of UADT cancer on endoscopic images. Materials and MethodsA dataset of white light and NBI videoframes of laryngeal squamous cell carcinoma (LSCC) was collected and manually annotated. A novel DL segmentation model (SegMENT) was designed. SegMENT relies on DeepLabV3+ CNN architecture, modified using Xception as a backbone and incorporating ensemble features from other CNNs. The performance of SegMENT was compared to state-of-the-art CNNs (UNet, ResUNet, and DeepLabv3). SegMENT was then validated on two external datasets of NBI images of oropharyngeal (OPSCC) and oral cavity SCC (OSCC) obtained from a previously published study. The impact of in-domain transfer learning through an ensemble technique was evaluated on the external datasets. Results219 LSCC patients were retrospectively included in the study. A total of 683 videoframes composed the LSCC dataset, while the external validation cohorts of OPSCC and OCSCC contained 116 and 102 images. On the LSCC dataset, SegMENT outperformed the other DL models, obtaining the following median values: 0.68 intersection over union (IoU), 0.81 dice similarity coefficient (DSC), 0.95 recall, 0.78 precision, 0.97 accuracy. For the OCSCC and OPSCC datasets, results were superior compared to previously published data: the median performance metrics were, respectively, improved as follows: DSC=10.3% and 11.9%, recall=15.0% and 5.1%, precision=17.0% and 14.7%, accuracy=4.1% and 10.3%. ConclusionSegMENT achieved promising performances, showing that automatic tumor segmentation in endoscopic images is feasible even within the highly heterogeneous and complex UADT environment. SegMENT outperformed the previously published results on the external validation cohorts. The model demonstrated potential for improved detection of early tumors, more precise biopsies, and better selection of resection margins.
引用
收藏
页数:14
相关论文
共 59 条
  • [1] Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
  • [2] Abraham N, 2019, I S BIOMED IMAGING, P683, DOI 10.1109/ISBI.2019.8759329
  • [3] Towards a Better Understanding of Transfer Learning for Medical Imaging: A Case Study
    Alzubaidi, Laith
    Fadhel, Mohammed A.
    Al-Shamma, Omran
    Zhang, Jinglan
    Santamaria, J.
    Duan, Ye
    Oleiwi, Sameer R.
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (13):
  • [4] [Anonymous], 2017, P BILD VER MED, DOI DOI 10.1007/978-3-662-54345-0_23
  • [5] Proposal for a descriptive guideline of vascular changes in lesions of the vocal folds by the committee on endoscopic laryngeal imaging of the European Laryngological Society
    Arens, Christoph
    Piazza, Cesare
    Andrea, Mario
    Dikkers, Frederik G.
    Gi, Robin E. A. Tjon Pian
    Voigt-Zimmermann, Susanne
    Peretti, Giorgio
    [J]. EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2016, 273 (05) : 1207 - 1214
  • [6] Deep Learning Applied to White Light and Narrow Band Imaging Videolaryngoscopy: Toward Real-Time Laryngeal Cancer Detection
    Azam, Muhammad Adeel
    Sampieri, Claudio
    Ioppi, Alessandro
    Africano, Stefano
    Vallin, Alberto
    Mocellin, Davide
    Fragale, Marco
    Guastini, Luca
    Moccia, Sara
    Piazza, Cesare
    Mattos, Leonardo S.
    Peretti, Giorgio
    [J]. LARYNGOSCOPE, 2022, 132 (09) : 1798 - 1806
  • [7] Effectiveness of narrow band imaging in the detection of premalignant and malignant lesions of the larynx: Validation of a new endoscopic clinical classification
    Bertino, Giulia
    Cacciola, Salvatore
    Fernandes, Wladir Bastos, Jr.
    Fernandes, Carolina Muniz
    Occhini, Antonio
    Tinelli, Carmine
    Benazzo, Marco
    [J]. HEAD AND NECK-JOURNAL FOR THE SCIENCES AND SPECIALTIES OF THE HEAD AND NECK, 2015, 37 (02): : 215 - 222
  • [8] Benchmark Analysis of Representative Deep Neural Network Architectures
    Bianco, Simone
    Cadene, Remi
    Celona, Luigi
    Napoletano, Paolo
    [J]. IEEE ACCESS, 2018, 6 : 64270 - 64277
  • [9] Application of bioendoscopy filters in endoscopic assessment of sinonasal Schneiderian papillomas
    Carobbio, Andrea Luigi Camillo
    Vallin, Alberto
    Ioppi, Alessandro
    Missale, Francesco
    Ascoli, Alessandro
    Mocellin, Davide
    Bagnasco, Diego
    Mora, Renzo
    Peretti, Giorgio
    Canevari, Frank Rikki Mauritz
    [J]. INTERNATIONAL FORUM OF ALLERGY & RHINOLOGY, 2021, 11 (06) : 1025 - 1028
  • [10] Enhanced contact endoscopy for the assessment of the neoangiogenetic changes in precancerous and cancerous lesions of the oral cavity and oropharynx
    Carta, Filippo
    Sionis, Sara
    Cocco, Daniela
    Gerosa, Clara
    Ferreli, Caterina
    Puxeddu, Roberto
    [J]. EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2016, 273 (07) : 1895 - 1903