Prototypical multiple instance learning for predicting lymph node metastasis of breast cancer from whole-slide pathological images

被引:32
作者
Yu, Jin-Gang [1 ,2 ]
Wu, Zihao [1 ]
Ming, Yu [1 ]
Deng, Shule [1 ]
Li, Yuanqing [1 ,2 ]
Ou, Caifeng [3 ]
He, Chunjiang [3 ]
Wang, Baiye [4 ]
Zhang, Pusheng [3 ]
Wang, Yu [5 ]
机构
[1] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
[2] Pazhou Lab, Guangzhou 510335, Peoples R China
[3] Southern Med Univ, Zhujiang Hosp, Dept Breast Surg, Guangzhou 510280, Peoples R China
[4] Southern Med Univ, Zhujiang Hosp, Dept Radiol, Guangzhou 510280, Peoples R China
[5] Southern Med Univ, Zhujiang Hosp, Dept Pathol, Guangzhou 510280, Peoples R China
基金
中国国家自然科学基金;
关键词
Computational pathology; Whole-slide images; Lymph node metastasis; Breast cancer; Prototypical multiple instance learning; BIOPSY;
D O I
10.1016/j.media.2023.102748
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computerized identification of lymph node metastasis of breast cancer (BCLNM) from whole-slide pathological images (WSIs) can largely benefit therapy decision and prognosis analysis. Besides the general challenges of computational pathology, like extra-high resolution, very expensive fine-grained annotation, etc., two particular difficulties with this task lie in (1) modeling the significant inter-tumoral heterogeneity in BCLNM pathological images, and (2) identifying micro-metastases, i.e., metastasized tumors with tiny foci. Towards this end, this paper presents a novel weakly supervised method, termed as Prototypical Multiple Instance Learning (PMIL), to learn to predict BCLNM from WSIs with slide-level class labels only. PMIL introduces the well -established vocabulary-based multiple instance learning (MIL) paradigm into computational pathology, which is characterized by utilizing the so-called prototypes to model pathological data and construct WSI features. PMIL mainly consists of two innovatively designed modules, i.e., the prototype discovery module which acquires prototypes from training data by unsupervised clustering, and the prototype-based slide embedding module which builds WSI features by matching constitutive patches against the prototypes. Relative to existing MIL methods for WSI classification, PMIL has two substantial merits: (1) being more explicit and interpretable in modeling the inter-tumoral heterogeneity in BCLNM pathological images, and (2) being more effective in identifying micro-metastases. Evaluation is conducted on two datasets, i.e., the public Camelyon16 dataset and the Zbraln dataset created by ourselves. PMIL achieves an AUC of 88.2% on Camelyon16 and 98.4% on Zbraln (at 40x magnification factor), which consistently outperforms other compared methods. Comprehensive analysis will also be carried out to further reveal the effectiveness and merits of the proposed method.
引用
收藏
页数:12
相关论文
共 47 条
[1]   Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association [J].
Abels, Esther ;
Pantanowitz, Liron ;
Aeffner, Famke ;
Zarella, Mark D. ;
van der Laak, Jeroen ;
Bui, Marilyn M. ;
Vemuri, Venkata N. P. ;
Parwani, Anil V. ;
Gibbs, Jeff ;
Agosto-Arroyo, Emmanuel ;
Beck, Andrew H. ;
Kozlowski, Cleopatra .
JOURNAL OF PATHOLOGY, 2019, 249 (03) :286-294
[2]   Novel techniques for sentinel lymph node biopsy in breast cancer: a systematic review [J].
Ahmed, Muneer ;
Purushotham, Arnie D. ;
Douek, Michael .
LANCET ONCOLOGY, 2014, 15 (08) :E351-E362
[3]   Role of sonography in the diagnosis of axillary lymph node metastases in breast cancer:: A systematic review [J].
Alvarez, S ;
Añorbe, E ;
Alcorta, P ;
López, F ;
Alonso, I ;
Cortés, J .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2006, 186 (05) :1342-1348
[4]   Multiple instance classification: Review, taxonomy and comparative study [J].
Amores, Jaume .
ARTIFICIAL INTELLIGENCE, 2013, 201 :81-105
[5]   From Detection of Individual Metastases to Classification of Lymph Node Status at the Patient Level: The CAMELYON17 Challenge [J].
Bandi, Peter ;
Geessink, Oscar ;
Manson, Quirine ;
van Dijk, Marcory ;
Balkenhol, Maschenka ;
Hermsen, Meyke ;
Bejnordi, Babak Ehteshami ;
Lee, Byungjae ;
Paeng, Kyunghyun ;
Zhong, Aoxiao ;
Li, Quanzheng ;
Zanjani, Farhad Ghazvinian ;
Zinger, Svitlana ;
Fukuta, Keisuke ;
Komura, Daisuke ;
Ovtcharov, Vlado ;
Cheng, Shenghua ;
Zeng, Shaoqun ;
Thagaard, Jeppe ;
Dahl, Anders B. ;
Lin, Huangjing ;
Chen, Hao ;
Jacobsson, Ludwig ;
Hedlund, Martin ;
Cetin, Melih ;
Halici, Eren ;
Jackson, Hunter ;
Chen, Richard ;
Both, Fabian ;
Franke, Joerg ;
Kusters-Vandevelde, Heidi ;
Vreuls, Willem ;
Bult, Peter ;
van Ginneken, Bram ;
van der Laak, Jeroen ;
Litjens, Geert .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (02) :550-560
[6]   Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer [J].
Bejnordi, Babak Ehteshami ;
Veta, Mitko ;
van Diest, Paul Johannes ;
van Ginneken, Bram ;
Karssemeijer, Nico ;
Litjens, Geert ;
van der Laak, Jeroen A. W. M. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (22) :2199-2210
[7]  
Caicedo JC, 2009, LECT NOTES ARTIF INT, V5651, P126, DOI 10.1007/978-3-642-02976-9_17
[8]   Clinical-grade computational pathology using weakly supervised deep learning on whole slide images [J].
Campanella, Gabriele ;
Hanna, Matthew G. ;
Geneslaw, Luke ;
Miraflor, Allen ;
Silva, Vitor Werneck Krauss ;
Busam, Klaus J. ;
Brogi, Edi ;
Reuter, Victor E. ;
Klimstra, David S. ;
Fuchs, Thomas J. .
NATURE MEDICINE, 2019, 25 (08) :1301-+
[9]   Multiple instance learning: A survey of problem characteristics and applications [J].
Carbonneau, Marc-Andre ;
Cheplygina, Veronika ;
Granger, Eric ;
Gagnon, Ghyslain .
PATTERN RECOGNITION, 2018, 77 :329-353
[10]  
Chen C. -L., 2021, Nature Commun., V12, P113