Automated Extraction of Grade, Stage, and Quality Information From Transurethral Resection of Bladder Tumor Pathology Reports Using Natural Language Processing

被引:35
作者
Glaser, Alexander P. [1 ,2 ]
Jordan, Brian J. [1 ,2 ]
Cohen, Jason [1 ]
Desai, Anuj [1 ]
Silberman, Philip [3 ]
Meeks, Joshua J. [1 ,2 ]
机构
[1] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[2] Northwestern Univ, Robert H Lurie Comprehens Canc Ctr, Chicago, IL 60611 USA
[3] Northwestern Univ, Clin & Translat Sci Inst, Chicago, IL 60611 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1200/CCI.17.00128
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose Bladder cancer is initially diagnosed and staged with a transurethral resection of bladder tumor (TURBT). Patient survival is dependent on appropriate sampling of layers of the bladder, but pathology reports are dictated as free text, making large-scale data extraction for quality improvement challenging. We sought to automate extraction of stage, grade, and quality information from TURBT pathology reports using natural language processing (NLP). Methods Patients undergoing TURBT were retrospectively identified using the Northwestern Enterprise Data Warehouse. An NLP algorithm was then created to extract information from free-text pathology reports and was iteratively improved using a training set of manually reviewed TURBTs. NLP accuracy was then validated using another set of manually reviewed TURBTs, and reliability was calculated using Cohen's kappa. Results Of 3,042 TURBTs identified from 2006 to 2016, 39% were classified as benign, 35% as Ta, 11% as T1, 4% as T2, and 10% as isolated carcinoma in situ. Of 500 randomly selected manually reviewed TURBTs, NLP correctly staged 88% of specimens (kappa = 0.82; 95% CI, 0.78 to 0.86). Of 272 manually reviewed T1 tumors, NLP correctly categorized grade in 100% of tumors (kappa = 1), correctly categorized if muscularis propria was reported by the pathologist in 98% of tumors (kappa = 0.81; 95% CI, 0.62 to 0.99), and correctly categorized if muscularis propria was present or absent in the resection specimen in 82% of tumors (kappa = 0.62; 95% CI, 0.55 to 0.73). Discrepancy analysis revealed pathologist notes and deeper resection specimens as frequent reasons for NLP misclassifications. Conclusion We developed an NLP algorithm that demonstrates a high degree of reliability in extracting stage, grade, and presence of muscularis propria from TURBT pathology reports. Future iterations can continue to improve performance, but automated extraction of oncologic information is promising in improving quality and assisting physicians in delivery of care. (C) 2018 by American Society of Clinical Oncology
引用
收藏
页码:1 / 8
页数:8
相关论文
共 20 条
[1]  
Amin MB, PROTOCOL EXAMINATION
[2]   Variability in the recurrence rate at first follow-up cystoscopy after TUR in stage Ta T1 transitional cell carcinoma of the bladder: A combined analysis of seven EORTC studies [J].
Brausi, M ;
Collette, L ;
Kurth, K ;
van der Meijden, AP ;
Oosterlinck, W ;
Witjes, JA ;
Newling, D ;
Bouffioux, C ;
Sylvester, RJ .
EUROPEAN UROLOGY, 2002, 41 (05) :523-530
[3]   Detrusor Muscle in TUR-Derived Bladder Tumor Specimens: Can We Actually Improve the Surgical Quality? [J].
Capogrosso, Paolo ;
Capitanio, Umberto ;
Ventimiglia, Eugenio ;
Boeri, Luca ;
Briganti, Alberto ;
Colombo, Renzo ;
Montorsi, Francesco ;
Salonia, Andrea .
JOURNAL OF ENDOUROLOGY, 2016, 30 (04) :400-405
[4]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[5]   Clinical Outcome in a Contemporary Series of Restaged Patients with Clinical T1 Bladder Cancer [J].
Dalbagni, Guido ;
Vora, Kinjal ;
Kaag, Matthew ;
Cronin, Angel ;
Bochner, Bernard ;
Donat, S. Machele ;
Herr, Harry W. .
EUROPEAN UROLOGY, 2009, 56 (06) :903-909
[6]   Impact of Routine Second Transurethral Resection on the Long-Term Outcome of Patients with Newly Diagnosed pT1 Urothelial Carcinoma with Respect to Recurrence, Progression Rate, and Disease-Specific Survival: A Prospective Randomised Clinical Trial [J].
Divrik, Rauf Taner ;
Sahin, Ali F. ;
Yildirim, Uemit ;
Altok, Muammer ;
Zorlu, Ferruh .
EUROPEAN UROLOGY, 2010, 58 (02) :185-190
[7]  
Gregg JR, 2017, JCO CLIN CANCER INFO, V1, DOI 10.1200/CCI.16.00045
[8]   Provider Treatment Intensity and Outcomes for Patients With Early-Stage Bladder Cancer [J].
Hollenbeck, Brent K. ;
Ye, Zaojun ;
Dunn, Rodney L. ;
Montie, James E. ;
Birkmeyer, John D. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2009, 101 (08) :571-580
[9]   Understanding the Variation in Treatment Intensity Among Patients With Early Stage Bladder Cancer [J].
Hollingsworth, John M. ;
Zhang, Yun ;
Krein, Sarah L. ;
Ye, Zaojun ;
Hollenbeck, Brent K. .
CANCER, 2010, 116 (15) :3587-3594
[10]   MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174