MuRCL: Multi-Instance Reinforcement Contrastive Learning for Whole Slide Image Classification

被引:20
作者
Zhu, Zhonghang [1 ]
Yu, Lequan [2 ]
Wu, Wei [3 ]
Yu, Rongshan [3 ]
Zhang, Defu [3 ]
Wang, Liansheng [3 ]
机构
[1] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Sch Informat, Xiamen 361005, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
关键词
Feature extraction; Training; Task analysis; Semantics; Reinforcement learning; Computational modeling; Self-supervised learning; Whole slide image analysis; multi-instance learning; contrastive learning; reinforcement learning;
D O I
10.1109/TMI.2022.3227066
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multi-instance learning (MIL) is widely adop- ted for automatic whole slide image (WSI) analysis and it usually consists of two stages, i.e., instance feature extraction and feature aggregation. However, due to the "weak supervision" of slide-level labels, the feature aggregation stage would suffer from severe over-fitting in training an effective MIL model. In this case, mining more information from limited slide-level data is pivotal to WSI analysis. Different from previous works on improving instance feature extraction, this paper investigates how to exploit the latent relationship of different instances (patches) to combat overfitting in MIL for more generalizable WSI classification. In particular, we propose a novel Multi-instance Rein- forcement Contrastive Learning framework (MuRCL) to deeply mine the inherent semantic relationships of different patches to advance WSI classification. Specifically, the proposed framework is first trained in a self-supervised manner and then finetuned with WSI slide-level labels. We formulate the first stage as a contrastive learning (CL) process, where positive/negative discriminative feature sets are constructed from the same patch-level feature bags of WSIs. To facilitate the CL training, we design a novel reinforcement learning-based agent to progressively update the selection of discriminative feature sets according to an online reward for slide-level feature aggregation. Then, we further update the model with labeled WSI data to regularize the learned features for the final WSI classification. Experimental results on three public WSI classification datasets (Camelyon16, TCGA-Lung and TCGA-Kidney) demonstrate that the proposed MuRCL outperforms state-of-the-art MIL models. In addition, MuRCL can achieve comparable performance to other state-of-the-art MIL models on TCGA-Esca dataset.
引用
收藏
页码:1337 / 1348
页数:12
相关论文
共 49 条
[1]  
Azizi S, 2021, Arxiv, DOI [arXiv:2101.05224, DOI 10.48550/ARXIV.2101.05224]
[2]  
Bachman P, 2019, Arxiv, DOI arXiv:1906.00910
[3]   Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction [J].
Bai, Wenjia ;
Chen, Chen ;
Tarroni, Giacomo ;
Duan, Jinming ;
Guitton, Florian ;
Petersen, Steffen E. ;
Guo, Yike ;
Matthews, Paul M. ;
Rueckert, Daniel .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :541-549
[4]   How to Learn from Unlabeled Volume Data: Self-supervised 3D Context Feature Learning [J].
Blendowski, Maximilian ;
Nickisch, Hannes ;
Heinrich, Mattias P. .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 :649-657
[5]   Clinical-grade computational pathology using weakly supervised deep learning on whole slide images [J].
Campanella, Gabriele ;
Hanna, Matthew G. ;
Geneslaw, Luke ;
Miraflor, Allen ;
Silva, Vitor Werneck Krauss ;
Busam, Klaus J. ;
Brogi, Edi ;
Reuter, Victor E. ;
Klimstra, David S. ;
Fuchs, Thomas J. .
NATURE MEDICINE, 2019, 25 (08) :1301-+
[6]  
Cao B, 2020, AAAI CONF ARTIF INTE, V34, P10486
[7]  
Chaitanya K, 2020, Arxiv, DOI arXiv:2006.10511
[8]   Self-supervised learning for medical image analysis using image context restoration [J].
Chen, Liang ;
Bentley, Paul ;
Mori, Kensaku ;
Misawa, Kazunari ;
Fujiwara, Michitaka ;
Rueckert, Daniel .
MEDICAL IMAGE ANALYSIS, 2019, 58
[9]   An augmented reality microscope with real-time artificial intelligence integration for cancer diagnosis [J].
Chen, Po-Hsuan Cameron ;
Gadepalli, Krishna ;
MacDonald, Robert ;
Liu, Yun ;
Kadowaki, Shiro ;
Nagpal, Kunal ;
Kohlberger, Timo ;
Dean, Jeffrey ;
Corrado, Greg S. ;
Hipp, Jason D. ;
Mermel, Craig H. ;
Stumpe, Martin C. .
NATURE MEDICINE, 2019, 25 (09) :1453-+
[10]   Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images [J].
Chen, Richard J. ;
Lu, Ming Y. ;
Weng, Wei-Hung ;
Chen, Tiffany Y. ;
Williamson, Drew F. K. ;
Manz, Trevor ;
Shady, Maha ;
Mahmood, Faisal .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3995-4005