A Simple Framework for Depth-Augmented Contrastive Learning for Endoscopic Image Classification

被引：0

作者：

Weng, Weihao ^{[1
]}

Zhu, Xin ^{[2
]}

Cheikh, Faouzi Alaya ^{[3
]}

Ullah, Mohib ^{[3
]}

Imaizumi, Mitsuyoshi ^{[4
]}

Murono, Shigeyuki ^{[4
]}

Kubota, Satoshi ^{[4
]}

机构：

[1] Univ Aizu, Grad Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima 9658580, Japan

[2] Inst Tokyo, M&D Data Sci Ctr, Dept AI Technol Dev, Tokyo 1010062, Japan

[3] Norwegian Univ Sci & Technol, Dept Comp Sci, N-2815 Gjovik, Norway

[4] Fukushima Med Univ, Dept Otolaryngol, Fukushima 9601295, Japan

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2024年 / 73卷

基金：

日本学术振兴会;

关键词：

Estimation; Training; Accuracy; Endoscopes; Image classification; Contrastive learning; Testing; Three-dimensional displays; Pneumonia; Pharynx; deep learning; depth estimation; endoscopic image classification; self-supervised; semi-supervised;

D O I：

10.1109/TIM.2024.3470015

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This article introduces a simple framework for depth-augmented contrastive learning (SimDCL), a novel approach to enhance endoscopic image classification by incorporating depth information. Unlike traditional methods that struggle with the absence of depth in 2-D endoscopic images, SimDCL leverages a depth estimation technique trained exclusively on da Vinci Xi endoscope data. This method not only addresses the challenge of obtaining accurate depth data for regions like the pharynges or larynges but also presents the information in a manner that aligns with medical professionals' expertise. Specifically, we designed a loss function for self-supervised depth estimation (SSDE), which performs well when trained on public datasets and then applied to data without depth information. In addition, we developed an augmentation method and corresponding loss function that utilize this depth information to improve the accuracy of endoscopic image classification. The evaluation involved a private dataset of 199 flexible endoscopic evaluation of swallowing (FEES) video images for training and 40 independent FEES video images for testing, along with two public datasets (Nerthus and Kvasir). SimDCL achieved an accuracy of 73.0% (72.7% for Nerthus and 81.6% for Kvasir), surpassing the performance of existing methods (CCSSL, CoMatch, and FixMatch) by margins (9.2%, 12.1%, and 17.8% for FEES, 9.82%, 11.33%, and 11.67% for Nerthus, and 4.21%, 5.42%, and 9.97% for Kvasir, respectively).

引用

页数：12

共 50 条

[1] Domain-Collaborative Contrastive Learning for Hyperspectral Image Classification
Luo, Haiyang
Qiao, Xueyi
Xu, Yongming
Zhong, Shengwei
Gong, Chen
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
[2] Image classification framework based on contrastive self-supervised learning
Zhao H.-W.
Zhang J.-R.
Zhu J.-P.
Li H.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
[3] Prototypical contrastive learning for image classification
Yang, Han
Li, Jun
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02): : 2059 - 2069
[4] Label contrastive learning for image classification
Han Yang
Jun Li
Soft Computing, 2023, 27 : 13477 - 13486
[5] Prototypical contrastive learning for image classification
Han Yang
Jun Li
Cluster Computing, 2024, 27 : 2059 - 2069
[6] Label contrastive learning for image classification
Yang, Han
Li, Jun
SOFT COMPUTING, 2023, 27 (18) : 13477 - 13486
[7] FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification
Wang, Xu
Ye, Tian
Kannan, Rajgopal
Prasanna, Viktor
2024 IEEE RADAR CONFERENCE, RADARCONF 2024, 2024,
[8] SimCLRT: A Simple Framework for Contrastive Learning of Rumor Tracking
Zeng, Hui
Cui, Xiaohui
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
[9] Self-Correlation Network With Triple Contrastive Learning for Hyperspectral Image Classification With Noisy Labels
Sarpong, Kwabena
Awrangjeb, Mohammad
Islam, Md. Saiful
Helmy, Islam
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7166 - 7188
[10] A Framework Using Contrastive Learning for Classification with Noisy Labels
Ciortan, Madalina
Dupuis, Romain
Peel, Thomas
DATA, 2021, 6 (06)

← 1 2 3 4 5 →