Systematic comparison of deep learning strategies for weakly supervised Gleason grading

被引:4
作者
Otalora, Sebastian [1 ,2 ]
Atzori, Manfredo [2 ]
Khan, Amjad [2 ]
Jimenez-del-Toro, Oscar [1 ]
Andrearczyk, Vincent [2 ]
Mueller, Henning [1 ,2 ]
机构
[1] Univ Appl Sci Western Switzerland HES SO, Inst Informat Syst, Sierre, Switzerland
[2] Univ Geneva, Geneva, Switzerland
来源
MEDICAL IMAGING 2020: DIGITAL PATHOLOGY | 2021年 / 11320卷
基金
欧盟地平线“2020”;
关键词
Computational pathology; prostate cancer; deep learning; weak supervision;
D O I
10.1117/12.2548571
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Prostate cancer (PCa) is one of the most frequent cancers in men. Its grading is required before initiating its treatment. The Gleason Score (GS) aims at describing and measuring the regularity in gland patterns observed by a pathologist on the microscopic or digital images of prostate biopsies and prostatectomies. Deep Learning-based (DL) models are the state-of-the-art computer vision techniques for Gleason grading, learning high-level features with high classification power. However, for obtaining robust models with clinical-grade performance, a large number of local annotations are needed. Previous research showed that it is feasible to detect low and high-grade PCa from digitized tissue slides relying only on the less expensive report-level (weakly) supervised labels, thus global rather than local labels. Despite this, few articles focus on classifying the finer-grained GS classes with weakly supervised models. The objective of this paper is to compare weakly supervised strategies for classification of the five classes of the GS from the whole slide image, using the global diagnostic label from the pathology reports as the only source of supervision. We compare different models trained on hand-crafted features, shallow and deep learning representations. The training and evaluation are done on the publicly available TCGA-PRAD dataset, comprising of 341 whole slide images of radical prostatectomies, where small patches are extracted within tissue areas and assigned the global report label as ground truth. Our results show that DL networks and class-wise data augmentation outperform other strategies and their combinations, reaching a kappa score of kappa = 0.44, which could be further improved with a larger dataset or combining both strong and weakly supervised models.
引用
收藏
页数:8
相关论文
共 22 条
  • [1] [Anonymous], P 3 INT C LEARNING R
  • [2] Arvaniti E., 2018, MED IMAGING MEETS NE
  • [3] Automated Gleason grading of prostate cancer tissue microarrays via deep learning
    Arvaniti, Eirini
    Fricker, Kim S.
    Moret, Michael
    Rupp, Niels
    Hermanns, Thomas
    Fankhauser, Christian
    Wey, Norbert
    Wild, Peter J.
    Ruschoff, Jan H.
    Claassen, Manfred
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [4] Burlutskiy N, 2019, PR MACH LEARN RES, V102, P92
  • [5] Clinical-grade computational pathology using weakly supervised deep learning on whole slide images
    Campanella, Gabriele
    Hanna, Matthew G.
    Geneslaw, Luke
    Miraflor, Allen
    Silva, Vitor Werneck Krauss
    Busam, Klaus J.
    Brogi, Edi
    Reuter, Victor E.
    Klimstra, David S.
    Fuchs, Thomas J.
    [J]. NATURE MEDICINE, 2019, 25 (08) : 1301 - +
  • [6] Advances in the computational and molecular understanding of the prostate cancer cell nucleus
    Carleton, Neil M.
    Lee, George
    Madabhushi, Anant
    Veltri, Robert W.
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2018, 119 (09) : 7127 - 7142
  • [7] Chang H, 2012, INT S BIOM IM
  • [8] Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent
    Cruz-Roa, Angel
    Gilmore, Hannah
    Basavanhally, Ajay
    Feldman, Michael
    Ganesan, Shridar
    Shih, Natalie N. C.
    Tomaszewski, John
    Gonzalez, Fabio A.
    Madabhushi, Anant
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [9] de Bel T., 2020, LANCET ONCOL
  • [10] Doyle S, 2006, LECT NOTES COMPUT SC, V4191, P504