Clustering-Guided Twin Contrastive Learning for Endomicroscopy Image Classification

被引：0

作者：

Zhou, Jingjun ^{[1
]}

Dong, Xiangjiang ^{[2
]}

Liu, Qian ^{[1
,3
]}

机构：

[1] Hainan Univ, Sch Biomed Engn, Haikou 570228, Peoples R China

[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China

[3] Hainan Univ, Sch Biomed Engn, Key Lab Biomed Engn Hainan Prov, Haikou 570228, Peoples R China

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2024年 / 28卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Clustering; contrastive learning; image classification and gastrointestinal; probe-based confocal laser endomicroscopy (pCLE); CONFOCAL LASER ENDOMICROSCOPY; SAFETY;

D O I：

10.1109/JBHI.2024.3366223

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning better representations is essential in medical image analysis for computer-aided diagnosis. However, learning discriminative semantic features is a major challenge due to the lack of large-scale well-annotated datasets. Thus, how can we learn a well-structured categorizable embedding space in limited-scale and unlabeled datasets? In this paper, we proposed a novel clustering-guided twin-contrastive learning framework (CTCL) that learns the discriminative representations of probe-based confocal laser endomicroscopy (pCLE) images for gastrointestinal (GI) tumor classification. Compared with traditional contrastive learning, in which only two randomly augmented views of the same instance are considered, the proposed CTCL aligns more semantically related and class-consistent samples by clustering, which improved intra-class tightness and inter-class variability to produce more informative representations. Furthermore, based on the inherent properties of CLE (geometric invariance and intrinsic noise), we proposed to regard CLE images with any angle rotation and CLE images with different noises as the same instance, respectively, for increased variability and diversity of samples. By optimizing CTCL in an end-to-end expectation-maximization framework, comprehensive experimental results demonstrated that CTCL-based visual representations achieved competitive performance on each downstream task as well as more robustness and transferability compared with existing state-of-the-art SSL and supervised methods. Notably, CTCL achieved 75.60%/78.45% and 64.12%/77.37% top-1 accuracy on the linear evaluation protocol and few-shot classification downstream tasks, respectively, which outperformed the previous best results by 1.27%/1.63% and 0.5%/3%, respectively. The proposed method holds great potential to assist pathologists in achieving an automated, fast, and high-precision diagnosis of GI tumors and accurately determining different stages of tumor development based on CLE images.

引用

页码：2879 / 2890

页数：12

共 56 条

[1] ENDOMICROSCOPIC VIDEO RETRIEVAL USING MOSAICING AND VISUAL WORDS
Andre, B.
Vercauteren, T.
Buchner, A. M.
Wallace, M. B.
Ayache, N.
[J]. 2010 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2010, : 1419 - 1422
[2] ENDOMICROSCOPIC IMAGE RETRIEVAL AND CLASSIFICATION USING INVARIANT VISUAL FEATURES
Andre, B.
Vercauteren, T.
Perchant, A.
Buchner, A. M.
Wallace, M. B.
Ayache, N.
[J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 346 - +
[3] Learning Semantic and Visual Similarity for Endomicroscopy Video Retrieval
Andre, Barbara
Vercauteren, Tom
Buchner, Anna M.
Wallace, Michael B.
Ayache, Nicholas
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (06) : 1276 - 1288
[4] André B, 2011, LECT NOTES COMPUT SC, V6893, P297, DOI 10.1007/978-3-642-23626-6_37
[5] Aubreville M., 2018, P 5 INT C BIOIM, P27
[6] Bachman P, 2019, ADV NEUR IN, V32
[7] Cao H., 2021, 2021 INT C NETW SYST, P181, DOI DOI 10.1109/INSAI54028.2021.00042
[8] Caron M, 2020, ADV NEUR IN, V33
[9] Deep Clustering for Unsupervised Learning of Visual Features
Caron, Mathilde
Bojanowski, Piotr
Joulin, Armand
Douze, Matthijs
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 139 - 156
[10] Imaging breast cancer morphology using probe-based confocal laser endomicroscopy: towards a real-time intraoperative imaging tool for cavity scanning
Chang, Tou Pin
Leff, Daniel R.
Shousha, Sami
Hadjiminas, Dimitri J.
Ramakrishnan, Rathi
Hughes, Michael R.
Yang, Guang-Zhong
Darzi, Ara
[J]. BREAST CANCER RESEARCH AND TREATMENT, 2015, 153 (02) : 299 - 310

← 1 2 3 4 5 6 →