Application of Natural Language Processing in Electronic Health Record Data Extraction for Navigating Prostate Cancer Care: A Narrative Review

被引:4
作者
Bhatia, Ansh [1 ,2 ]
Titus, Renil [2 ]
Porto, Joao G. [1 ]
Katz, Jonathan [3 ]
Lopategui, Diana M. [1 ]
Marcovich, Robert [1 ]
Parekh, Dipen J. [1 ]
Shah, Hemendra N. [1 ]
机构
[1] Univ Miami, Desai Sethi Urol Inst, Miller Sch Med, Miami, FL USA
[2] Seth GS Med Coll & King Edward Mem Hosp, Mumbai, Maharashtra, India
[3] Univ Calif, Dept Urol, San Diego, CA USA
关键词
artificial intelligence; natural language processing; prostate cancer; cancer staging;
D O I
10.1089/end.2023.0690
中图分类号
R5 [内科学]; R69 [泌尿科学(泌尿生殖系疾病)];
学科分类号
1002 ; 100201 ;
摘要
Introduction: Natural language processing (NLP)-based data extraction from electronic health records (EHRs) holds significant potential to simplify clinical management and aid research. This review aims to evaluate the current landscape of NLP-based data extraction in prostate cancer (PCa) management. Materials and Methods: We conducted a literature search of PubMed and Google Scholar databases using the keywords: "Natural Language Processing," "Prostate Cancer," "data extraction," and "EHR" with variations of each. No language or time limits were imposed. All results were collected in a standardized manner, including country of origin, sample size, algorithm, objective of outcome, and model performance. The precision, recall, and the F1 score of studies were collected as a metric of model performance. Results: Of the 14 studies included in the review, 2 articles focused on documenting digital rectal examinations, 1 on identifying and quantifying pain secondary to PCa, 8 on extracting staging/grading information from clinical reports, with an emphasis on TNM-classification, risk stratification, and identifying metastasis, 2 articles focused on patient-centered post-treatment outcomes such as incontinence, erectile, and bowel dysfunction, and 1 on loneliness/social isolation following PCa diagnosis. All models showed moderate to high data annotation/extraction accuracy compared with the gold standard method of manual data extraction by chart review. Despite their potential, NLPs face challenges in handling ambiguous, institution-specific language and context nuances, leading to occasional inaccuracies in clinical data interpretation. Conclusion: NLP-based data extraction has effectively extracted various outcomes from PCa patients' EHRs. It holds the potential for automating outcome monitoring and data collection, resulting in time and labor savings.
引用
收藏
页码:852 / 864
页数:13
相关论文
共 44 条
[1]   Ascertainment of Veterans With Metastatic Prostate Cancer in Electronic Health Records: Demonstrating the Case for Natural Language Processing [J].
Alba, Patrick R. ;
Gao, Anthony ;
Lee, Kyung Min ;
Anglin-Foote, Tori ;
Robison, Brian ;
Katsoulakis, Evangelia ;
Rose, Brent S. ;
Efimova, Olga ;
Ferraro, Jeffrey P. ;
Patterson, Olga V. ;
Shelton, Jeremy B. ;
Duvall, Scott L. ;
Lynch, Julie A. .
JCO CLINICAL CANCER INFORMATICS, 2021, 5 :1005-1014
[2]   Natural language processing in low back pain and spine diseases: A systematic review [J].
Bacco, Luca ;
Russo, Fabrizio ;
Ambrosio, Luca ;
D'Antoni, Federico ;
Vollero, Luca ;
Vadala, Gianluca ;
Dell'Orletta, Felice ;
Merone, Mario ;
Papalia, Rocco ;
Denaro, Vincenzo .
FRONTIERS IN SURGERY, 2022, 9
[3]   Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment [J].
Banerjee, Imon ;
Li, Kevin ;
Seneviratne, Martin ;
Ferrari, Michelle ;
Seto, Tina ;
Brooks, James D. ;
Rubin, Daniel L. ;
Hernandez-Boussard, Tina .
JAMIA OPEN, 2019, 2 (01) :150-159
[4]   Natural language processing with machine learning to predict outcomes after ovarian cancer surgery [J].
Barber, Emma L. ;
Garg, Ravi ;
Persenaire, Christianne ;
Simon, Melissa .
GYNECOLOGIC ONCOLOGY, 2021, 160 (01) :182-186
[5]  
Bohr A., 2020, Artif. intell. healthc, P25, DOI DOI 10.1016/B978-0-12-818438-7.00002-2
[6]   Expanding the Secondary Use of Prostate Cancer Real World Data: Automated Classifiers for Clinical and Pathological Stage [J].
Bozkurt, Selen ;
Magnani, Christopher J. J. ;
Seneviratne, Martin G. G. ;
Brooks, James D. D. ;
Hernandez-Boussard, Tina .
FRONTIERS IN DIGITAL HEALTH, 2022, 4
[7]   Is it possible to automatically assess pretreatment digital rectal examination documentation using natural language processing? A single-centre retrospective study [J].
Bozkurt, Selen ;
Kan, Kathleen M. ;
Ferrari, Michelle K. ;
Rubin, Daniel L. ;
Blayney, Douglas W. ;
Hernandez-Boussard, Tina ;
Brooks, James D. .
BMJ OPEN, 2019, 9 (07)
[8]  
Bozkurt Selen, 2018, AMIA Annu Symp Proc, V2018, P288
[9]   Development of a HIPAA-compliant environment for translational research data and analytics [J].
Bradford, Wayne ;
Hurdle, John F. ;
LaSalle, Bernie ;
Facelli, Julio C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (01) :185-189
[10]   Healthcare Reimbursement and Quality Improvement: Integration Using the Electronic Medical Record Comment on "Fee-for-Service Payment - an Evil Practice That Must Be Stamped Out?" [J].
Britton, John R. .
INTERNATIONAL JOURNAL OF HEALTH POLICY AND MANAGEMENT, 2015, 4 (08) :549-551