Critical evaluation of artificial intelligence as a digital twin of pathologists for prostate cancer pathology

被引:0
作者
Okyaz Eminaga
Mahmoud Abbas
Christian Kunder
Yuri Tolkach
Ryan Han
James D. Brooks
Rosalie Nolley
Axel Semjonow
Martin Boegemann
Robert West
Jin Long
Richard E. Fan
Olaf Bettendorf
机构
[1] AI Vobis,Department of Pathology, Prostate Center
[2] University Hospital Muenster,Department of Pathology
[3] Stanford University School of Medicine,Department of Pathology
[4] Cologne University Hospital,Department of Computer Science
[5] Stanford University,Department of Urology
[6] Stanford University School of Medicine,Department of Urology, Prostate Center
[7] University Hospital Muenster,Department of Pediatrics
[8] Stanford University School of Medicine,undefined
[9] Institute for Pathology and Cytology,undefined
来源
Scientific Reports | / 14卷
关键词
Artificial intelligence; Prostate cancer; Gleason grading system; ISUP; Deep learning; Automation; Stress tests; Digital twin; Pathology;
D O I
暂无
中图分类号
学科分类号
摘要
Prostate cancer pathology plays a crucial role in clinical management but is time-consuming. Artificial intelligence (AI) shows promise in detecting prostate cancer and grading patterns. We tested an AI-based digital twin of a pathologist, vPatho, on 2603 histological images of prostate tissue stained with hematoxylin and eosin. We analyzed various factors influencing tumor grade discordance between the vPatho system and six human pathologists. Our results demonstrated that vPatho achieved comparable performance in prostate cancer detection and tumor volume estimation, as reported in the literature. The concordance levels between vPatho and human pathologists were examined. Notably, moderate to substantial agreement was observed in identifying complementary histological features such as ductal, cribriform, nerve, blood vessel, and lymphocyte infiltration. However, concordance in tumor grading decreased when applied to prostatectomy specimens (κ = 0.44) compared to biopsy cores (κ = 0.70). Adjusting the decision threshold for the secondary Gleason pattern from 5 to 10% improved the concordance level between pathologists and vPatho for tumor grading on prostatectomy specimens (κ from 0.44 to 0.64). Potential causes of grade discordance included the vertical extent of tumors toward the prostate boundary and the proportions of slides with prostate cancer. Gleason pattern 4 was particularly associated with this population. Notably, the grade according to vPatho was not specific to any of the six pathologists involved in routine clinical grading. In conclusion, our study highlights the potential utility of AI in developing a digital twin for a pathologist. This approach can help uncover limitations in AI adoption and the practical application of the current grading system for prostate cancer pathology.
引用
收藏
相关论文
共 91 条
  • [1] Jemal A(2010)Cancer statistics, 2010 CA Cancer J. Clin. 60 277-300
  • [2] Siegel R(2011)A competing-risks analysis of survival after alternative treatment modalities for prostate cancer patients: 1988–2006 Eur. Urol. 59 88-95
  • [3] Xu J(2008)Recommendations for the reporting of prostate carcinoma: Association of directors of anatomic and surgical pathology Am. J. Clin. Pathol. 129 24-30
  • [4] Ward E(2010)Clinical map document based on XML (cMDX): Document architecture with mapping feature for reporting and analysing prostate cancer in radical prostatectomy specimens BMC Med. Inform. Decis. Mak 10 71-124
  • [5] Abdollah F(2012)Handling of radical prostatectomy specimens Histopathology 60 118-133
  • [6] Epstein JI(2018)Automated Gleason grading of prostate cancer tissue microarrays via deep learning Sci. Rep. 8 12054-241
  • [7] Srigley J(2019)Author correction: Automated Gleason grading of prostate cancer tissue microarrays via deep learning Sci. Rep. 9 7668-232
  • [8] Grignon D(2018)An EM-based semi-supervised deep learning approach for semantic segmentation of histopathological images from radical prostatectomies Comput. Med. Imaging Graph. 69 125-1380
  • [9] Humphrey P(2019)Clinical-grade computational pathology using weakly supervised deep learning on whole slide images Nat. Med. 2 48-671
  • [10] Eminaga O(2019)Development and validation of a deep learning algorithm for improving Gleason scoring of prostate cancer npj Digit. Med. 21 233-116