A Simple Framework for Depth-Augmented Contrastive Learning for Endoscopic Image Classification

被引:0
|
作者
Weng, Weihao [1 ]
Zhu, Xin [2 ]
Cheikh, Faouzi Alaya [3 ]
Ullah, Mohib [3 ]
Imaizumi, Mitsuyoshi [4 ]
Murono, Shigeyuki [4 ]
Kubota, Satoshi [4 ]
机构
[1] Univ Aizu, Grad Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima 9658580, Japan
[2] Inst Tokyo, M&D Data Sci Ctr, Dept AI Technol Dev, Tokyo 1010062, Japan
[3] Norwegian Univ Sci & Technol, Dept Comp Sci, N-2815 Gjovik, Norway
[4] Fukushima Med Univ, Dept Otolaryngol, Fukushima 9601295, Japan
基金
日本学术振兴会;
关键词
Estimation; Training; Accuracy; Endoscopes; Image classification; Contrastive learning; Testing; Three-dimensional displays; Pneumonia; Pharynx; deep learning; depth estimation; endoscopic image classification; self-supervised; semi-supervised;
D O I
10.1109/TIM.2024.3470015
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article introduces a simple framework for depth-augmented contrastive learning (SimDCL), a novel approach to enhance endoscopic image classification by incorporating depth information. Unlike traditional methods that struggle with the absence of depth in 2-D endoscopic images, SimDCL leverages a depth estimation technique trained exclusively on da Vinci Xi endoscope data. This method not only addresses the challenge of obtaining accurate depth data for regions like the pharynges or larynges but also presents the information in a manner that aligns with medical professionals' expertise. Specifically, we designed a loss function for self-supervised depth estimation (SSDE), which performs well when trained on public datasets and then applied to data without depth information. In addition, we developed an augmentation method and corresponding loss function that utilize this depth information to improve the accuracy of endoscopic image classification. The evaluation involved a private dataset of 199 flexible endoscopic evaluation of swallowing (FEES) video images for training and 40 independent FEES video images for testing, along with two public datasets (Nerthus and Kvasir). SimDCL achieved an accuracy of 73.0% (72.7% for Nerthus and 81.6% for Kvasir), surpassing the performance of existing methods (CCSSL, CoMatch, and FixMatch) by margins (9.2%, 12.1%, and 17.8% for FEES, 9.82%, 11.33%, and 11.67% for Nerthus, and 4.21%, 5.42%, and 9.97% for Kvasir, respectively).
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Feature Contrastive Transfer Learning for Few-Shot Long-Tail Sonar Image Classification
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE COMMUNICATIONS LETTERS, 2025, 29 (03) : 562 - 566
  • [32] Domain Fusion Contrastive Learning for Cross-Scene Hyperspectral Image Classification
    Qiu, Zhao
    Xu, Jie
    Peng, Jiangtao
    Sun, Weiwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [33] Contrastive prototype learning with semantic patchmix for few-shot image classification
    Dong, Mengping
    Lei, Fei
    Li, Zhenbo
    Liu, Xue
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [34] MCTGCL: Mixed CNNTransformer for Mars Hyperspectral Image Classification With Graph Contrastive Learning
    Xi, Bobo
    Zhang, Yun
    Li, Jiaojiao
    Zheng, Tie
    Zhao, Xunfeng
    Xu, Haitao
    Xue, Changbin
    Li, Yunsong
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [35] FedCL: Federated contrastive learning for multi-center medical image classification
    Liu, Zhenbing
    Wu, Fengfeng
    Wang, Yumeng
    Yang, Mengyu
    Pan, Xipeng
    PATTERN RECOGNITION, 2023, 143
  • [36] Detection of Bone Metastases on Bone Scans through Image Classification with Contrastive Learning
    Hsieh, Te-Chun
    Liao, Chiung-Wei
    Lai, Yung-Chi
    Law, Kin-Man
    Chan, Pak-Ki
    Kao, Chia-Hung
    JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (12):
  • [37] A Unified Multiscale Learning Framework for Hyperspectral Image Classification
    Wang, Xue
    Tan, Kun
    Du, Peijun
    Pan, Chen
    Ding, Jianwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [38] Histopathology Image Classification Using Deep Manifold Contrastive Learning
    Tan, Jing Wei
    Jeong, Won-Ki
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 683 - 692
  • [39] CDARL: a contrastive discriminator-augmented reinforcement learning framework for sequential recommendations
    Zhuang Liu
    Yunpu Ma
    Marcel Hildebrandt
    Yuanxin Ouyang
    Zhang Xiong
    Knowledge and Information Systems, 2022, 64 : 2239 - 2265
  • [40] CONICA: A Contrastive Image Captioning Framework with Robust Similarity Learning
    Deng, Lin
    Zhong, Yuzhong
    Wang, Maoning
    Zhang, Jianwei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5109 - 5119