Classification of endoscopic image and video frames using distance metric-based learning with interpolated latent features

被引:0
作者
Fatemeh Sedighipour Chafjiri
Mohammad Reza Mohebbian
Khan A. Wahid
Paul Babyn
机构
[1] University of Saskatchewan,Department of Electrical and Computer Engineering
[2] University of Saskatchewan and Saskatchewan Health Authority,Department of Medical Imaging
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Endoscopy; Few shot learning; Manifold mix-up; Siamese neural network; Classification; GI track anatomic locations;
D O I
暂无
中图分类号
学科分类号
摘要
Conventional Endoscopy (CE) and Wireless Capsule Endoscopy (WCE) are well known tools for diagnosing gastrointestinal (GI) tract related disorders. Defining the anatomical location within the GI tract helps clinicians determine appropriate treatment options, which can reduce the need for repetitive endoscopy. Limited research addresses the localization of the anatomical location of WCE and CE images using classification, mainly due to the difficulty in collecting annotated data. In this study, we present a few-shot learning method based on distance metric learning which combines transfer-learning and manifold mixup schemes to localize and classify endoscopic images and video frames. The proposed method allows us to develop a pipeline for endoscopy video sequence localization that can be trained with only a few samples. The use of manifold mixup improves learning by increasing the number of training epochs while reducing overfitting and providing more accurate decision boundaries. A dataset is collected from 10 different anatomical positions of the human GI tract. Two models were trained using only 78 CE and 27 WCE annotated frames to predict the location of 25,700 and 1825 video frames from CE and WCE respectively. We performed subjective evaluation using nine gastroenterologists to validate the need of having such an automated system to localize endoscopic images and video frames. Our method achieved higher accuracy and a higher F1-score when compared with the scores from subjective evaluation. In addition, the results show improved performance with less cross-entropy loss when compared with several existing methods trained on the same datasets. This indicates that the proposed method has the potential to be used in endoscopy image classification.
引用
收藏
页码:36577 / 36598
页数:21
相关论文
共 114 条
  • [11] Säckinger E(2016)Matching networks for one shot learning Adv Neural Inf Process Syst 29 3630-1072
  • [12] Shah R(2010)A survey on transfer learning IEEE Trans Knowl Data Eng 22 1345-3472
  • [13] Ciuti G(2021)Discriminative ensemble learning for few-shot chest x-ray diagnosis Med Image Anal 68 101911-105
  • [14] Menciassi A(2014)Feasibility of capsule endoscopy for direct imaging of drug delivery systems in the fasted upper-gastrointestinal tract Pharm Res 31 2044-8
  • [15] Dario P(2001)A comparison of endoscopic ultrasound, magnetic resonance imaging, and exam under anesthesia for evaluation of Crohn’s perianal fistulas Gastroenterology 121 1064-2399
  • [16] Cunha JPS(2019)A novel passive magnetic localization wearable system for wireless capsule endoscopy IEEE Sensors J 19 3462-3547
  • [17] Coimbra M(2012)Wireless capsule endoscopy video segmentation using an unsupervised learning approach based on probabilistic latent semantic analysis with scale invariant features IEEE Trans Inf Technol Biomed 16 98-34
  • [18] Campos P(2018)Automatic anatomical classification of esophagogastroduodenoscopy images using deep convolutional neural networks Sci Rep 8 1-650
  • [19] Soares JM(2012)A review of localization systems for robotic endoscopic capsules IEEE Trans Biomed Eng 59 2387-238
  • [20] Lewis BS(2020)Generalized support vector data description for anomaly detection Pattern Recogn 100 107119-415