Clustering-Guided Twin Contrastive Learning for Endomicroscopy Image Classification

被引:0
作者
Zhou, Jingjun [1 ]
Dong, Xiangjiang [2 ]
Liu, Qian [1 ,3 ]
机构
[1] Hainan Univ, Sch Biomed Engn, Haikou 570228, Peoples R China
[2] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[3] Hainan Univ, Sch Biomed Engn, Key Lab Biomed Engn Hainan Prov, Haikou 570228, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; contrastive learning; image classification and gastrointestinal; probe-based confocal laser endomicroscopy (pCLE); CONFOCAL LASER ENDOMICROSCOPY; SAFETY;
D O I
10.1109/JBHI.2024.3366223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning better representations is essential in medical image analysis for computer-aided diagnosis. However, learning discriminative semantic features is a major challenge due to the lack of large-scale well-annotated datasets. Thus, how can we learn a well-structured categorizable embedding space in limited-scale and unlabeled datasets? In this paper, we proposed a novel clustering-guided twin-contrastive learning framework (CTCL) that learns the discriminative representations of probe-based confocal laser endomicroscopy (pCLE) images for gastrointestinal (GI) tumor classification. Compared with traditional contrastive learning, in which only two randomly augmented views of the same instance are considered, the proposed CTCL aligns more semantically related and class-consistent samples by clustering, which improved intra-class tightness and inter-class variability to produce more informative representations. Furthermore, based on the inherent properties of CLE (geometric invariance and intrinsic noise), we proposed to regard CLE images with any angle rotation and CLE images with different noises as the same instance, respectively, for increased variability and diversity of samples. By optimizing CTCL in an end-to-end expectation-maximization framework, comprehensive experimental results demonstrated that CTCL-based visual representations achieved competitive performance on each downstream task as well as more robustness and transferability compared with existing state-of-the-art SSL and supervised methods. Notably, CTCL achieved 75.60%/78.45% and 64.12%/77.37% top-1 accuracy on the linear evaluation protocol and few-shot classification downstream tasks, respectively, which outperformed the previous best results by 1.27%/1.63% and 0.5%/3%, respectively. The proposed method holds great potential to assist pathologists in achieving an automated, fast, and high-precision diagnosis of GI tumors and accurately determining different stages of tumor development based on CLE images.
引用
收藏
页码:2879 / 2890
页数:12
相关论文
共 56 条
  • [1] ENDOMICROSCOPIC VIDEO RETRIEVAL USING MOSAICING AND VISUAL WORDS
    Andre, B.
    Vercauteren, T.
    Buchner, A. M.
    Wallace, M. B.
    Ayache, N.
    [J]. 2010 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2010, : 1419 - 1422
  • [2] ENDOMICROSCOPIC IMAGE RETRIEVAL AND CLASSIFICATION USING INVARIANT VISUAL FEATURES
    Andre, B.
    Vercauteren, T.
    Perchant, A.
    Buchner, A. M.
    Wallace, M. B.
    Ayache, N.
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 346 - +
  • [3] Learning Semantic and Visual Similarity for Endomicroscopy Video Retrieval
    Andre, Barbara
    Vercauteren, Tom
    Buchner, Anna M.
    Wallace, Michael B.
    Ayache, Nicholas
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (06) : 1276 - 1288
  • [4] André B, 2011, LECT NOTES COMPUT SC, V6893, P297, DOI 10.1007/978-3-642-23626-6_37
  • [5] Aubreville M., 2018, P 5 INT C BIOIM, P27
  • [6] Bachman P, 2019, ADV NEUR IN, V32
  • [7] Cao H., 2021, 2021 INT C NETW SYST, P181, DOI DOI 10.1109/INSAI54028.2021.00042
  • [8] Caron M, 2020, ADV NEUR IN, V33
  • [9] Deep Clustering for Unsupervised Learning of Visual Features
    Caron, Mathilde
    Bojanowski, Piotr
    Joulin, Armand
    Douze, Matthijs
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 139 - 156
  • [10] Imaging breast cancer morphology using probe-based confocal laser endomicroscopy: towards a real-time intraoperative imaging tool for cavity scanning
    Chang, Tou Pin
    Leff, Daniel R.
    Shousha, Sami
    Hadjiminas, Dimitri J.
    Ramakrishnan, Rathi
    Hughes, Michael R.
    Yang, Guang-Zhong
    Darzi, Ara
    [J]. BREAST CANCER RESEARCH AND TREATMENT, 2015, 153 (02) : 299 - 310