Videomics of the Upper Aero-Digestive Tract Cancer: Deep Learning Applied to White Light and Narrow Band Imaging for Automatic Segmentation of Endoscopic Images

被引:25
作者
Azam, Muhammad Adeel [1 ]
Sampieri, Claudio [2 ,3 ]
Ioppi, Alessandro [2 ,3 ]
Benzi, Pietro [2 ,3 ]
Giordano, Giorgio Gregory [2 ,3 ]
De Vecchi, Marta [2 ,3 ]
Campagnari, Valentina [2 ,3 ]
Li, Shunlei [1 ]
Guastini, Luca [2 ,3 ]
Paderno, Alberto [4 ,5 ]
Moccia, Sara [6 ,7 ]
Piazza, Cesare [4 ,5 ]
Mattos, Leonardo S. [1 ]
Peretti, Giorgio [2 ,3 ]
机构
[1] Ist Italiano Tecnol, Dept Adv Robot, Genoa, Italy
[2] IRCCS Osped Policlin San Martino, Unit Otorhinolaryngol Head & Neck Surg, Genoa, Italy
[3] Univ Genoa, Dept Surg Sci & Integrated Diagnost DISC, Genoa, Italy
[4] Unit Otorhinolaryngol Head & Neck Surg, ASST Spedali Civili Brescia, Brescia, Italy
[5] Univ Brescia, Dept Med & Surg Specialties, Radiol Sci & Publ Hlth, Brescia, Italy
[6] Scuola Super Sant Anna, BioRobot Inst, Pisa, Italy
[7] Scuola Super Sant Anna, Dept Excellence Robot & AI, Pisa, Italy
来源
FRONTIERS IN ONCOLOGY | 2022年 / 12卷
关键词
larynx cancer; oral cancer; oropharynx cancer; machine learning; endoscopy; laryngoscopy; computer vision; otorhinolaryngology; HIGH-DEFINITION TELEVISION; SQUAMOUS-CELL CARCINOMA; RESECTION MARGINS; ORAL-CAVITY; LESIONS; NASOPHARYNX;
D O I
10.3389/fonc.2022.900451
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
IntroductionNarrow Band Imaging (NBI) is an endoscopic visualization technique useful for upper aero-digestive tract (UADT) cancer detection and margins evaluation. However, NBI analysis is strongly operator-dependent and requires high expertise, thus limiting its wider implementation. Recently, artificial intelligence (AI) has demonstrated potential for applications in UADT videoendoscopy. Among AI methods, deep learning algorithms, and especially convolutional neural networks (CNNs), are particularly suitable for delineating cancers on videoendoscopy. This study is aimed to develop a CNN for automatic semantic segmentation of UADT cancer on endoscopic images. Materials and MethodsA dataset of white light and NBI videoframes of laryngeal squamous cell carcinoma (LSCC) was collected and manually annotated. A novel DL segmentation model (SegMENT) was designed. SegMENT relies on DeepLabV3+ CNN architecture, modified using Xception as a backbone and incorporating ensemble features from other CNNs. The performance of SegMENT was compared to state-of-the-art CNNs (UNet, ResUNet, and DeepLabv3). SegMENT was then validated on two external datasets of NBI images of oropharyngeal (OPSCC) and oral cavity SCC (OSCC) obtained from a previously published study. The impact of in-domain transfer learning through an ensemble technique was evaluated on the external datasets. Results219 LSCC patients were retrospectively included in the study. A total of 683 videoframes composed the LSCC dataset, while the external validation cohorts of OPSCC and OCSCC contained 116 and 102 images. On the LSCC dataset, SegMENT outperformed the other DL models, obtaining the following median values: 0.68 intersection over union (IoU), 0.81 dice similarity coefficient (DSC), 0.95 recall, 0.78 precision, 0.97 accuracy. For the OCSCC and OPSCC datasets, results were superior compared to previously published data: the median performance metrics were, respectively, improved as follows: DSC=10.3% and 11.9%, recall=15.0% and 5.1%, precision=17.0% and 14.7%, accuracy=4.1% and 10.3%. ConclusionSegMENT achieved promising performances, showing that automatic tumor segmentation in endoscopic images is feasible even within the highly heterogeneous and complex UADT environment. SegMENT outperformed the previously published results on the external validation cohorts. The model demonstrated potential for improved detection of early tumors, more precise biopsies, and better selection of resection margins.
引用
收藏
页数:14
相关论文
共 59 条
  • [21] Impact of Close and Positive Margins in Transoral Laser Microsurgery for Tis-T2 Glottic Cancer
    Fiz, Ivana
    Mazzola, Francesco
    Fiz, Francesco
    Marchi, Filippo
    Filauro, Marta
    Paderno, Alberto
    Parrinello, Giampiero
    Piazza, Cesare
    Peretti, Giorgio
    [J]. FRONTIERS IN ONCOLOGY, 2017, 7
  • [22] Role of Narrow Band Imaging Technology in the Diagnosis and Follow up of Laryngeal Lesions: Assessment of Diagnostic Accuracy and Reliability in a Large Patient Cohort
    Galli, Jacopo
    Settimi, Stefano
    Mele, Dario Antonio
    Salvati, Antonio
    Schiavi, Enrico
    Parrilla, Claudio
    Paludetti, Gaetano
    [J]. JOURNAL OF CLINICAL MEDICINE, 2021, 10 (06) : 1 - 10
  • [23] Intraoperative Narrow Band Imaging Better Delineates Superficial Resection Margins During Transoral Laser Microsurgery for Early Glottic Cancer
    Garofolo, Sabrina
    Piazza, Cesare
    Del Bon, Francesca
    Mangili, Stefano
    Guastini, Luca
    Mora, Francesco
    Nicolai, Piero
    Peretti, Giorgio
    [J]. ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2015, 124 (04) : 294 - 298
  • [24] Using deep learning to identify the recurrent laryngeal nerve during thyroidectomy
    Gong, Julia
    Holsinger, F. Christopher
    Noel, Julia E.
    Mitani, Sohei
    Jopling, Jeff
    Bedi, Nikita
    Koh, Yoon Woo
    Orloff, Lisa A.
    Cernea, Claudio R.
    Yeung, Serena
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [25] Artificial intelligence system for detecting superficial laryngopharyngeal cancer with high efficiency of deep learning
    Inaba, Atsushi
    Hori, Keisuke
    Yoda, Yusuke
    Ikematsu, Hiroaki
    Takano, Hiroaki
    Matsuzaki, Hiroki
    Watanabe, Yoshiki
    Takeshita, Nobuyoshi
    Tomioka, Toshifumi
    Ishii, Genichiro
    Fujii, Satoshi
    Hayashi, Ryuichi
    Yano, Tomonori
    [J]. HEAD AND NECK-JOURNAL FOR THE SCIENCES AND SPECIALTIES OF THE HEAD AND NECK, 2020, 42 (09): : 2581 - 2592
  • [26] Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning
    Jha, Debesh
    Ali, Sharib
    Tomar, Nikhil Kumar
    Johansen, Havard D.
    Johansen, Dag
    Rittscher, Jens
    Riegler, Michael A.
    Halvorsen, Pal
    [J]. IEEE ACCESS, 2021, 9 : 40496 - 40510
  • [27] A multi-scale recurrent fully convolution neural network for laryngeal leukoplakia segmentation
    Ji, Bin
    Ren, Jianjun
    Zheng, Xiujuan
    Tan, Cong
    Ji, Rong
    Zhao, Yu
    Liu, Kai
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
  • [28] Narrow-band imaging for screening of oral premalignant or cancerous lesions: A systematic review and meta-analysis
    Kim, Do Hyun
    Kim, Sung Won
    Lee, Jaeyoon
    Hwang, Se Hwan
    [J]. CLINICAL OTOLARYNGOLOGY, 2021, 46 (03) : 501 - 507
  • [29] Artificial intelligence and computer-aided diagnosis for colonoscopy: where do we stand now?
    Kudo, Shin-Ei
    Mori, Yuichi
    Abdel-aal, Usama M.
    Misawa, Masashi
    Itoh, Hayato
    Oda, Masahiro
    Mori, Kensaku
    [J]. TRANSLATIONAL GASTROENTEROLOGY AND HEPATOLOGY, 2021, 6
  • [30] A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation
    Laves, Max-Heinrich
    Bicker, Jens
    Kahrs, Lueder A.
    Ortmaier, Tobias
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (03) : 483 - 492