A Simple Framework for Depth-Augmented Contrastive Learning for Endoscopic Image Classification

被引:0
|
作者
Weng, Weihao [1 ]
Zhu, Xin [2 ]
Cheikh, Faouzi Alaya [3 ]
Ullah, Mohib [3 ]
Imaizumi, Mitsuyoshi [4 ]
Murono, Shigeyuki [4 ]
Kubota, Satoshi [4 ]
机构
[1] Univ Aizu, Grad Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima 9658580, Japan
[2] Inst Tokyo, M&D Data Sci Ctr, Dept AI Technol Dev, Tokyo 1010062, Japan
[3] Norwegian Univ Sci & Technol, Dept Comp Sci, N-2815 Gjovik, Norway
[4] Fukushima Med Univ, Dept Otolaryngol, Fukushima 9601295, Japan
基金
日本学术振兴会;
关键词
Estimation; Training; Accuracy; Endoscopes; Image classification; Contrastive learning; Testing; Three-dimensional displays; Pneumonia; Pharynx; deep learning; depth estimation; endoscopic image classification; self-supervised; semi-supervised;
D O I
10.1109/TIM.2024.3470015
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article introduces a simple framework for depth-augmented contrastive learning (SimDCL), a novel approach to enhance endoscopic image classification by incorporating depth information. Unlike traditional methods that struggle with the absence of depth in 2-D endoscopic images, SimDCL leverages a depth estimation technique trained exclusively on da Vinci Xi endoscope data. This method not only addresses the challenge of obtaining accurate depth data for regions like the pharynges or larynges but also presents the information in a manner that aligns with medical professionals' expertise. Specifically, we designed a loss function for self-supervised depth estimation (SSDE), which performs well when trained on public datasets and then applied to data without depth information. In addition, we developed an augmentation method and corresponding loss function that utilize this depth information to improve the accuracy of endoscopic image classification. The evaluation involved a private dataset of 199 flexible endoscopic evaluation of swallowing (FEES) video images for training and 40 independent FEES video images for testing, along with two public datasets (Nerthus and Kvasir). SimDCL achieved an accuracy of 73.0% (72.7% for Nerthus and 81.6% for Kvasir), surpassing the performance of existing methods (CCSSL, CoMatch, and FixMatch) by margins (9.2%, 12.1%, and 17.8% for FEES, 9.82%, 11.33%, and 11.67% for Nerthus, and 4.21%, 5.42%, and 9.97% for Kvasir, respectively).
引用
收藏
页数:12
相关论文
共 50 条
  • [41] PROTOTYPE GOVERNED CONTRASTIVE LEARNING FOR ROBUST IMAGE CLASSIFICATION IN HISTOPATHOLOGY
    Tinaikar, Aashay
    Raipuria, Geetank
    Singhal, Nitin
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [42] Cross-Modality Contrastive Learning for Hyperspectral Image Classification
    Hang, Renlong
    Qian, Xuwei
    Liu, Qingshan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] CDARL: a contrastive discriminator-augmented reinforcement learning framework for sequential recommendations
    Liu, Zhuang
    Ma, Yunpu
    Hildebrandt, Marcel
    Ouyang, Yuanxin
    Xiong, Zhang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (08) : 2239 - 2265
  • [44] Adversarial Domain Alignment With Contrastive Learning for Hyperspectral Image Classification
    Liu, Fang
    Gao, Wenfei
    Liu, Jia
    Tang, Xu
    Xiao, Liang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] Supervised Contrastive Learning-Based Classification for Hyperspectral Image
    Huang, Lingbo
    Chen, Yushi
    He, Xin
    Ghamisi, Pedram
    REMOTE SENSING, 2022, 14 (21)
  • [46] Renal Pathological Image Classification Based on Contrastive and Transfer Learning
    Liu, Xinkai
    Zhu, Xin
    Tian, Xingjian
    Iwasaki, Tsuyoshi
    Sato, Atsuya
    Kazama, Junichiro James
    ELECTRONICS, 2024, 13 (07)
  • [47] SPATIAL-SPECTRAL CONTRASTIVE LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Guan, Peiyan
    Lam, Edmund Y.
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1372 - 1375
  • [48] MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification
    Zhu, Zhonghang
    Yu, Lequan
    Wu, Wei
    Yu, Rongshan
    Zhang, Defu
    Wang, Liansheng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1337 - 1348
  • [49] Polarimetry-Inspired Contrastive Learning for Class-Imbalanced PolSAR Image Classification
    Kuang, Zuzheng
    Bi, Haixia
    Li, Fan
    Xu, Chen
    Sun, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 19
  • [50] Spectral-Spatial Masked Transformer With Supervised and Contrastive Learning for Hyperspectral Image Classification
    Huang, Lingbo
    Chen, Yushi
    He, Xin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61