ProMIL: A weakly supervised multiple instance learning for whole slide image classification based on class proxy

被引:7
作者
Li, Xiaoyu [1 ]
Yang, Bei [1 ]
Chen, Tiandong [2 ,3 ]
Gao, Zheng [1 ]
Huang, Mengjie [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Affiliated Canc Hosp, Dept Pathol, Zhengzhou 450008, Henan, Peoples R China
[3] Henan Canc Hosp, Zhengzhou 450008, Henan, Peoples R China
关键词
Class proxy; Whole slide image; Metric learning; Multiple instance learning; Attention mechanism; TRANSFORMER;
D O I
10.1016/j.eswa.2023.121800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The histopathological analysis of a suspected region is critical for cancer diagnosis, treatment, and manage-ment. Histopathological diagnosis consists in analyzing the characteristics of the lesions using tissue sections stained with hematoxylin and eosin. Classification of digital tumor pathology images, called whole slide images (WSIs), is a great challenge since WSIs usually have huge resolutions while lacking localized annotations. Multiple instance learning (MIL) is a commonly used method applied to pathological image analysis. However, most MIL methods often focus only on the global representation of WSIs, ignoring whether the category labels play other roles in the model training besides being a supervision signal. In addition, feature confusion is also a problem that should be avoided for the analysis of WSIs with weakly supervised methods. To address these problems, we propose a novel algorithm of classifying WSI for cancer diagnosis. The proposed model, ProMIL, uses only slide-level labels rather than localized annotations for analysis. There are three innovations in this work. Firstly, we present the concept of class proxy which is the representation of the intrinsic feature of each category, and plays a key role in guiding the training of the model. Secondly, we design a novel WSI representation learning module that utilizes a multi-scale feature extraction strategy to represent each patch in a WSI and then aggregates these representations using an attention mechanism to encode the WSI. Thirdly, we design a metric-learning-based weakly supervised multiclass-classifier by measuring the similarity between each WSI embedding and class proxies. The proposed ProMIL can effectively alleviate the side effect of feature confusion, and carry intuitive interpretability and scalability. To evaluate the performance of ProMIL, we conduct a series of experiments on several datasets of WSIs with different types of cancer from open data sources. It can be observed from the experimental results that ProMIL outperforms most of the compared methods and achieves better performance on a various type of cancer image data for classification, thus suggesting the proposed method is suitable for classifying different categories of cancer rather than a specific kind of cancer. Therefore, it is expected to act as a general framework to be extended to more cancer diagnoses.
引用
收藏
页数:12
相关论文
共 57 条
[1]   Breast Cancer Classification from Histopathological Images with Inception Recurrent Residual Convolutional Neural Network [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Nasrin, Shamima ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF DIGITAL IMAGING, 2019, 32 (04) :605-617
[2]  
An J., 2022, INT C MED IMAG DEEP
[3]   C-Net: A reliable convolutional neural network for biomedical image classification [J].
Barzekar, Hosein ;
Yu, Zeyun .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
[4]   Clinical-grade computational pathology using weakly supervised deep learning on whole slide images [J].
Campanella, Gabriele ;
Hanna, Matthew G. ;
Geneslaw, Luke ;
Miraflor, Allen ;
Silva, Vitor Werneck Krauss ;
Busam, Klaus J. ;
Brogi, Edi ;
Reuter, Victor E. ;
Klimstra, David S. ;
Fuchs, Thomas J. .
NATURE MEDICINE, 2019, 25 (08) :1301-+
[5]  
Chen T, 2020, PR MACH LEARN RES, V119
[6]  
Chikontwe Philip, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12265), P519, DOI 10.1007/978-3-030-59722-1_50
[7]  
Courtiol P, 2020, Arxiv, DOI [arXiv:1802.02212, DOI 10.48550/ARXIV.1802.02212]
[8]   ESTIMATION BY NEAREST NEIGHBOR RULE [J].
COVER, TM .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1968, 14 (01) :50-+
[9]   Machine learning approaches in medical image analysis: From detection to diagnosis [J].
de Bruijne, Marleen .
MEDICAL IMAGE ANALYSIS, 2016, 33 :94-97
[10]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694