Development and Validation of an Artificial Intelligence-Powered Platform for Prostate Cancer Grading and Quantification

被引:25
|
作者
Huang, Wei [1 ,2 ]
Randhawa, Ramandeep [2 ,3 ]
Jain, Parag [2 ]
Iczkowski, Kenneth A. [4 ]
Hu, Rong [1 ]
Hubbard, Samuel [1 ]
Eickhoff, Jens [5 ]
Basu, Hirak [6 ]
Roy, Rajat [2 ]
机构
[1] Univ Wisconsin, Sch Med & Publ Hlth, Dept Pathol & Lab Med, 1111 Highland Ave, Madison, WI 53705 USA
[2] PathomIQ, Silicon Valley, CA USA
[3] Univ Southern Calif, Marshall Sch Business, Los Angeles, CA 90007 USA
[4] Med Coll Wisconsin, Dept Pathol, Milwaukee, WI 53226 USA
[5] Univ Wisconsin, Dept Biostat & Informat, Madison, WI 53706 USA
[6] Univ Texas MD Anderson Canc Ctr, Univ Texas Hlth Sci Ctr Houston, Dept Genitourinary Med Oncol, Houston, TX 77030 USA
关键词
ISUP CONSENSUS-CONFERENCE; INTEROBSERVER REPRODUCIBILITY; INTERNATIONAL SOCIETY; CLINICAL STAGE; BIOPSIES; CARCINOMA; DIAGNOSIS; ADENOCARCINOMA; UTILITY; UPDATE;
D O I
10.1001/jamanetworkopen.2021.32554
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IMPORTANCE The Gleason grading system has been the most reliable tool for the prognosis of prostate cancer since its development. However, its clinical application remains limited by interobserver variability in grading and quantification, which has negative consequences for risk assessment and clinical management of prostate cancer. OBJECTIVE To examine the impact of an artificial intelligence (AI)-assisted approach to prostate cancer grading and quantification. DESIGN, SETTING, AND PARTICIPANTS This diagnostic study was conducted at the University of Wisconsin-Madison from August 2, 2017, to December 30, 2019. The study chronologically selected 589 men with biopsy-confirmed prostate cancer who received care in the University of Wisconsin Health System between January 1, 2005, and February 28, 2017. A total of 1000 biopsy slides (1 or 2 slides per patient) were selected and scanned to create digital whole-slide images, which were used to develop and validate a deep convolutional neural network-based AI-powered platform. The whole-slide images were divided into a training set (n = 838) and validation set (n = 162). Three experienced academic urological pathologists (W.H., K.A.I., and R.H., hereinafter referred to as pathologists 1, 2, and 3, respectively) were involved in the validation. Data were collected between December 29, 2018, and December 20, 2019, and analyzed from January 4, 2020, to March 1, 2021. MAIN OUTCOMES AND MEASURES Accuracy of prostate cancer detection by the AI-powered platform and comparison of prostate cancer grading and quantification performed by the 3 pathologists using manual vs AI-assisted methods. RESULTS Among 589 men with biopsy slides, the mean (SD) age was 63.8 (8.2) years, the mean (SD) prebiopsy prostate-specific antigen level was 10.2 (16.2) ng/mL, and the mean (SD) total cancer volume was 15.4% (20.1%). The AI system was able to distinguish prostate cancer from benign prostatic epithelium and stroma with high accuracy at the patch-pixel level, with an area under the receiver operating characteristic curve of 0.92 (95% CI, 0.88-0.95). The AI system achieved almost perfect agreement with the training pathologist (pathologist 1) in detecting prostate cancer at the patch-pixel level (weighted kappa = 0.97; asymptotic 95% CI, 0.96-0.98) and in grading prostate cancer at the slide level (weighted kappa = 0.98; asymptotic 95% CI, 0.96-1.00). Use of the AI-assisted method was associated with significant improvements in the concordance of prostate cancer grading and quantification between the 3 pathologists (eg, pathologists 1 and 2: 90.1% agreement using AI-assisted method vs 84.0% agreement using manual method; P < .001) and significantly higher weighted kappa values for all pathologists (eg, pathologists 2 and 3: weighted kappa = 0.92 [asymptotic 95% CI, 0.90-0.94] for AI-assisted method vs 0.76 [asymptotic 95% CI, 0.71-0.80] for manual method; P < .001) compared with the manual method. CONCLUSIONS AND RELEVANCE In this diagnostic study, an AI-powered platform was able to detect, grade, and quantify prostate cancer with high accuracy and efficiency and was associated with significant reductions in interobserver variability. These results suggest that an AI-powered platform could potentially transform histopathological evaluation and improve risk stratification and clinical management of prostate cancer.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Identification of areas of grading difficulties in prostate cancer and comparison with artificial intelligence assisted grading
    Egevad, Lars
    Swanberg, Daniela
    Delahunt, Brett
    Strom, Peter
    Kartasalo, Kimmo
    Olsson, Henrik
    Berney, Dan M.
    Bostwick, David G.
    Evans, Andrew J.
    Humphrey, Peter A.
    Iczkowski, Kenneth A.
    Kench, James G.
    Kristiansen, Glen
    Leite, Katia R. M.
    McKenney, Jesse K.
    Oxley, Jon
    Pan, Chin-Chen
    Samaratunga, Hemamali
    Srigley, John R.
    Takahashi, Hiroyuki
    Tsuzuki, Toyonori
    van der Kwast, Theo
    Varma, Murali
    Zhou, Ming
    Clements, Mark
    Eklund, Martin
    VIRCHOWS ARCHIV, 2020, 477 (06) : 777 - 786
  • [2] Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge
    Bulten, Wouter
    Kartasalo, Kimmo
    Chen, Po-Hsuan Cameron
    Strom, Peter
    Pinckaers, Hans
    Nagpal, Kunal
    Cai, Yuannan
    Steiner, David F.
    van Boven, Hester
    Vink, Robert
    Hulsbergen-van de Kaa, Christina
    van der Laak, Jeroen
    Amin, Mahul B.
    Evans, Andrew J.
    van der Kwast, Theodorus
    Allan, Robert
    Humphrey, Peter A.
    Gronberg, Henrik
    Samaratunga, Hemamali
    Delahunt, Brett
    Tsuzuki, Toyonori
    Hakkinen, Tomi
    Egevad, Lars
    Demkin, Maggie
    Dane, Sohier
    Tan, Fraser
    Valkonen, Masi
    Corrado, Greg S.
    Peng, Lily
    Mermel, Craig H.
    Ruusuvuori, Pekka
    Litjens, Geert
    Eklund, Martin
    NATURE MEDICINE, 2022, 28 (01) : 154 - +
  • [3] The value of artificial intelligence for detection and grading of prostate cancer in human prostatectomy specimens: a validation study
    Kudo, Maira Suzuka
    Gomes de Souza, Vinicius Meneguette
    Neubarth Estivallet, Carmen Liane
    de Amorim, Henrique Alves
    Kim, Fernando J.
    Moreira Leite, Katia Ramos
    Moraes, Matheus Cardoso
    PATIENT SAFETY IN SURGERY, 2022, 16 (01)
  • [4] External validation of an artificial intelligence model for Gleason grading of prostate cancer on prostatectomy specimens
    Schmidt, Bogdana
    Soerensen, Simon John Christoph
    Bhambhvani, Hriday P.
    Fan, Richard E.
    Bhattacharya, Indrani
    Choi, Moon Hyung
    Kunder, Christian A.
    Kao, Chia-Sui
    Higgins, John
    Rusu, Mirabela
    Sonn, Geoffrey A.
    BJU INTERNATIONAL, 2025, 135 (01) : 133 - 139
  • [5] Validation of a contemporary prostate cancer grading system using prostate cancer death as outcome
    Berney, Daniel M.
    Beltran, Luis
    Fisher, Gabrielle
    North, Bernard V.
    Greenberg, David
    Moller, Henrik
    Soosay, Geraldine
    Scardino, Peter
    Cuzick, Jack
    BRITISH JOURNAL OF CANCER, 2016, 114 (10) : 1078 - 1083
  • [6] Artificial intelligence-powered dentistry: Probing the potential, challenges, and ethicality of artificial intelligence in dentistry
    Rahim, Abid
    Khatoon, Rabia
    Khan, Tahir Ali
    Syed, Kawish
    Khan, Ibrahim
    Khalid, Tamsal
    Khalid, Balaj
    DIGITAL HEALTH, 2024, 10
  • [7] Artificial Intelligence-Powered Mammography: Navigating the Landscape of Deep Learning for Breast Cancer Detection
    Al Muhaisen, Sahem
    Safi, Omar
    Ulayan, Ahmad
    Aljawamis, Sara
    Fakhoury, Maryam
    Baydoun, Haneen
    Abuquteish, Dua
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (03)
  • [8] Artificial intelligence in pathologic diagnosis, prognosis and prediction of prostate cancer
    Zhu, Min
    Sali, Rasoul
    Baba, Firas
    Khasawneh, Hamdi
    Ryndin, Michelle
    Leveillee, Raymond J.
    Hurwitz, Mark
    Lui, Kin
    Dixon, Christopher
    Zhang, David Y.
    AMERICAN JOURNAL OF CLINICAL AND EXPERIMENTAL UROLOGY, 2024, 12 (04): : 200 - 215
  • [9] Development and Validation of a Deep Learning Algorithm for Gleason Grading of Prostate Cancer From Biopsy Specimens
    Nagpal, Kunal
    Foote, Davis
    Tan, Fraser
    Liu, Yun
    Chen, Po-Hsuan Cameron
    Steiner, David F.
    Manoj, Naren
    Olson, Niels
    Smith, Jenny L.
    Mohtashamian, Arash
    Peterson, Brandon
    Amin, Mahul B.
    Evans, Andrew J.
    Sweet, Joan W.
    Cheung, Carol
    van der Kwast, Theodorus
    Sangoi, Ankur R.
    Zhou, Ming
    Allan, Robert
    Humphrey, Peter A.
    Hipp, Jason D.
    Gadepalli, Krishna
    Corrado, Greg S.
    Peng, Lily H.
    Stumpe, Martin C.
    Mermel, Craig H.
    JAMA ONCOLOGY, 2020, 6 (09) : 1372 - 1380
  • [10] Predicting prostate cancer specific-mortality with artificial intelligence-based Gleason grading
    Wulczyn, Ellery
    Nagpal, Kunal
    Symonds, Matthew
    Moran, Melissa
    Plass, Markus
    Reihs, Robert
    Nader, Farah
    Tan, Fraser
    Cai, Yuannan
    Brown, Trissia
    Flament-Auvigne, Isabelle
    Amin, Mahul B.
    Stumpe, Martin C.
    Muller, Heimo
    Regitnig, Peter
    Holzinger, Andreas
    Corrado, Greg S.
    Peng, Lily H.
    Chen, Po-Hsuan Cameron
    Steiner, David F.
    Zatloukal, Kurt
    Liu, Yun
    Mermel, Craig H.
    COMMUNICATIONS MEDICINE, 2021, 1 (01):