Systematic comparison of deep learning strategies for weakly supervised Gleason grading

被引：4

作者：

Otalora, Sebastian ^{[1
,2
]}

Atzori, Manfredo ^{[2
]}

Khan, Amjad ^{[2
]}

Jimenez-del-Toro, Oscar ^{[1
]}

Andrearczyk, Vincent ^{[2
]}

Mueller, Henning ^{[1
,2
]}

机构：

[1] Univ Appl Sci Western Switzerland HES SO, Inst Informat Syst, Sierre, Switzerland

[2] Univ Geneva, Geneva, Switzerland

来源：

MEDICAL IMAGING 2020: DIGITAL PATHOLOGY | 2021年 / 11320卷

基金：

欧盟地平线“2020”;

关键词：

Computational pathology; prostate cancer; deep learning; weak supervision;

D O I：

10.1117/12.2548571

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Prostate cancer (PCa) is one of the most frequent cancers in men. Its grading is required before initiating its treatment. The Gleason Score (GS) aims at describing and measuring the regularity in gland patterns observed by a pathologist on the microscopic or digital images of prostate biopsies and prostatectomies. Deep Learning-based (DL) models are the state-of-the-art computer vision techniques for Gleason grading, learning high-level features with high classification power. However, for obtaining robust models with clinical-grade performance, a large number of local annotations are needed. Previous research showed that it is feasible to detect low and high-grade PCa from digitized tissue slides relying only on the less expensive report-level (weakly) supervised labels, thus global rather than local labels. Despite this, few articles focus on classifying the finer-grained GS classes with weakly supervised models. The objective of this paper is to compare weakly supervised strategies for classification of the five classes of the GS from the whole slide image, using the global diagnostic label from the pathology reports as the only source of supervision. We compare different models trained on hand-crafted features, shallow and deep learning representations. The training and evaluation are done on the publicly available TCGA-PRAD dataset, comprising of 341 whole slide images of radical prostatectomies, where small patches are extracted within tissue areas and assigned the global report label as ground truth. Our results show that DL networks and class-wise data augmentation outperform other strategies and their combinations, reaching a kappa score of kappa = 0.44, which could be further improved with a larger dataset or combining both strong and weakly supervised models.

引用

页数：8

共 22 条

[1] [Anonymous], P 3 INT C LEARNING R
[2] Arvaniti E., 2018, MED IMAGING MEETS NE
[3] Automated Gleason grading of prostate cancer tissue microarrays via deep learning
Arvaniti, Eirini
Fricker, Kim S.
Moret, Michael
Rupp, Niels
Hermanns, Thomas
Fankhauser, Christian
Wey, Norbert
Wild, Peter J.
Ruschoff, Jan H.
Claassen, Manfred
[J]. SCIENTIFIC REPORTS, 2018, 8
[4] Burlutskiy N, 2019, PR MACH LEARN RES, V102, P92
[5] Clinical-grade computational pathology using weakly supervised deep learning on whole slide images
Campanella, Gabriele
Hanna, Matthew G.
Geneslaw, Luke
Miraflor, Allen
Silva, Vitor Werneck Krauss
Busam, Klaus J.
Brogi, Edi
Reuter, Victor E.
Klimstra, David S.
Fuchs, Thomas J.
[J]. NATURE MEDICINE, 2019, 25 (08) : 1301 - +
[6] Advances in the computational and molecular understanding of the prostate cancer cell nucleus
Carleton, Neil M.
Lee, George
Madabhushi, Anant
Veltri, Robert W.
[J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2018, 119 (09) : 7127 - 7142
[7] Chang H, 2012, INT S BIOM IM
[8] Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent
Cruz-Roa, Angel
Gilmore, Hannah
Basavanhally, Ajay
Feldman, Michael
Ganesan, Shridar
Shih, Natalie N. C.
Tomaszewski, John
Gonzalez, Fabio A.
Madabhushi, Anant
[J]. SCIENTIFIC REPORTS, 2017, 7
[9] de Bel T., 2020, LANCET ONCOL
[10] Doyle S, 2006, LECT NOTES COMPUT SC, V4191, P504

← 1 2 3 →