Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology

被引:105
作者
Laleh, Narmin Ghaffari [1 ]
Muti, Hannah Sophie [1 ]
Loeffler, Chiara Maria Lavinia [1 ]
Echle, Amelie [1 ]
Saldanha, Oliver Lester [1 ]
Mahmood, Faisal [2 ]
Lu, Ming Y. [2 ]
Trautwein, Christian [1 ]
Langer, Rupert [4 ]
Dislich, Bastian [3 ]
Buelow, Roman D. [5 ]
Grabsch, Heike Irmgard [6 ,7 ]
Brenner, Hermann [8 ,9 ,10 ]
Chang-Claude, Jenny [11 ,12 ]
Alwers, Elizabeth [8 ]
Brinker, Titus J. [13 ]
Khader, Firas [14 ]
Truhn, Daniel [14 ]
Gaisa, Nadine T. [5 ]
Boor, Peter [5 ]
Hoffmeister, Michael [8 ]
Schulz, Volkmar [15 ,16 ,17 ,18 ]
Kather, Jakob Nikolas [1 ,7 ,19 ]
机构
[1] Univ Hosp RWTH Aachen, Dept Med 3, Aachen, Germany
[2] Harvard Med Sch, Brigham & Womens Hosp, Dept Pathol, Boston, MA USA
[3] Univ Bern, Inst Pathol, Bern, Switzerland
[4] Johannes Kepler Univ Linz, Kepler Univ Hosp, Inst Pathol & Mol Pathol, Linz, Austria
[5] Univ Hosp RWTH Aachen, Inst Pathol, Aachen, Germany
[6] Maastricht Univ, GROW Sch Oncol & Dev Biol, Dept Pathol, Med Ctr, Maastricht, Netherlands
[7] Univ Leeds, Leeds Inst Med Res St Jamess, Div Pathol & Data Analyt, Leeds, England
[8] German Canc Res Ctr, Div Clin Epidemiol & Aging Res, Heidelberg, Germany
[9] German Canc Res Ctr, Div Prevent Oncol, Heidelberg, Germany
[10] German Canc Res Ctr, German Canc Consortium DKTK, Heidelberg, Germany
[11] German Canc Res Ctr, Div Canc Epidemiol, Heidelberg, Germany
[12] Univ Med Ctr Hamburg Eppendorf, Univ Canc Ctr Hamburg, Canc Epidemiol Grp, Hamburg, Germany
[13] German Canc Res Ctr, Digital Biomarkers Oncol Grp, Heidelberg, Germany
[14] Univ Hosp RWTH Aachen, Dept Radiol, Aachen, Germany
[15] Rhein Westfal TH Aachen, Dept Phys Mol Imaging Syst Expt Mol Imaging, Aachen, Germany
[16] Fraunhofer Inst Digital Med MEVIS, Bremen, Germany
[17] Univ Hosp Aachen, Comprehens Diagnost Ctr Aachen CDCA, Aachen, Germany
[18] Hyper Hybrid Imaging Syst GmbH, Aachen, Germany
[19] Tech Univ Dresden, Med Fac Carl Gustav Carus, Else Kroener Fresenius Ctr Digital Hlth, Dresden, Germany
基金
欧洲研究理事会;
关键词
Computational pathology; Artificial intelligence; Weakly-supervised deep learning; Vision transformers; Convolutional neural networks; Multiple-Instance Learning; COLORECTAL-CANCER; MICROSATELLITE INSTABILITY; PROSTATE-CANCER; NEURAL-NETWORK; COLONOSCOPY; PREDICTION; BIOPSIES;
D O I
10.1016/j.media.2022.102474
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other.We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N = 2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA , allowing easy application of all methods to any similar task.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 79 条
[1]   External validation of molecular subtype classifications of colorectal cancer based on microsatellite instability, CIMP, BRAF and KRAS [J].
Alwers, Elizabeth ;
Blaeker, Hendrik ;
Walter, Viola ;
Jansen, Lina ;
Kloor, Matthias ;
Arnold, Alexander ;
Sieber-Frank, Julia ;
Herpel, Esther ;
Tagscherer, Katrin E. ;
Roth, Wilfried ;
Chang-Claude, Jenny ;
Brenner, Hermann ;
Hoffmeister, Michael .
BMC CANCER, 2019, 19 (1)
[2]  
[Anonymous], 2019, MOL TESTING STRATEGI
[3]   Comprehensive molecular characterization of gastric adenocarcinoma [J].
Bass, Adam J. ;
Thorsson, Vesteinn ;
Shmulevich, Ilya ;
Reynolds, Sheila M. ;
Miller, Michael ;
Bernard, Brady ;
Hinoue, Toshinori ;
Laird, Peter W. ;
Curtis, Christina ;
Shen, Hui ;
Weisenberger, Daniel J. ;
Schultz, Nikolaus ;
Shen, Ronglai ;
Weinhold, Nils ;
Keiser, David P. ;
Bowlby, Reanne ;
Sipahimalani, Payal ;
Cherniack, Andrew D. ;
Getz, Gad ;
Liu, Yingchun ;
Noble, Michael S. ;
Pedamallu, Chandra ;
Sougnez, Carrie ;
Taylor-Weiner, Amaro ;
Akbani, Rehan ;
Lee, Ju-Seog ;
Liu, Wenbin ;
Mills, Gordon B. ;
Yang, Da ;
Zhang, Wei ;
Pantazi, Angeliki ;
Parfenov, Michael ;
Gulley, Margaret ;
Piazuelo, M. Blanca ;
Schneider, Barbara G. ;
Kim, Jihun ;
Boussioutas, Alex ;
Sheth, Margi ;
Demchok, John A. ;
Rabkin, Charles S. ;
Willis, Joseph E. ;
Ng, Sam ;
Garman, Katherine ;
Beer, David G. ;
Pennathur, Arjun ;
Raphael, Benjamin J. ;
Wu, Hsin-Ta ;
Odze, Robert ;
Kim, Hark K. ;
Bowen, Jay .
NATURE, 2014, 513 (7517) :202-209
[4]   Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer [J].
Bejnordi, Babak Ehteshami ;
Veta, Mitko ;
van Diest, Paul Johannes ;
van Ginneken, Bram ;
Karssemeijer, Nico ;
Litjens, Geert ;
van der Laak, Jeroen A. W. M. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (22) :2199-2210
[5]  
Bengs M., MED ICAL IMAGING 202
[6]  
Berrada L., 2018, ARXIV CSLG
[7]   Cervical cancer detection in pap smear whole slide images using convNet with transfer learning and progressive resizing [J].
Bhatt, Anant R. ;
Ganatra, Amit ;
Kotecha, Ketan .
PEERJ COMPUTER SCIENCE, 2021, PeerJ Inc. (07) :1-18
[8]  
Bilal M., LANCET DIGITAL, V2021
[9]  
Brenner H, 2006, GUT, V55, P1145, DOI [10.1136/gut.2005087130, 10.1136/gut.2005.087130]
[10]   Protection From Colorectal Cancer After Colonoscopy A Population-Based, Case-Control Study [J].
Brenner, Hermann ;
Chang-Claude, Jenny ;
Seiler, Christoph M. ;
Rickert, Alexander ;
Hoffmeister, Michael .
ANNALS OF INTERNAL MEDICINE, 2011, 154 (01) :22-U156