Using Natural Language Processing to Improve Discrete Data Capture From Interpretive Cervical Biopsy Diagnoses at a Large Health Care Organization

被引:3
作者
Wi, Soora [1 ,3 ]
Goldhoff, Patricia E. [1 ]
Fuller, Laurie A. [1 ]
Grewal, Kiranjit [1 ]
Wentzensen, Nicolas [2 ]
Clarke, Megan A. [2 ]
Lorey, Thomas S. [1 ]
机构
[1] Kaiser Permanente, TPMG Reg Labs, Berkeley, CA USA
[2] NCI, Div Canc Epidemiol & Genet, Bethesda, MD USA
[3] Kaiser Permanente, TPMG Reg Lab, 1725 Eastshore Hwy, Berkeley, CA 94710 USA
关键词
TERMINOLOGY; PROJECT; LESIONS; TEXT;
D O I
10.5858/arpa.2021-0410-OA
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Context.-The terminology used by pathologists to describe and grade dysplasia and premalignant changes of the cervical epithelium has evolved over time. Unfor-tunately, coexistence of different classification systems combined with nonstandardized interpretive text has created multiple layers of interpretive ambiguity.Objective.-To use natural language processing (NLP) to automate and expedite translation of interpretive text to a single most severe, and thus actionable, cervical intraep-ithelial neoplasia (CIN) diagnosis.Design.-We developed and applied NLP algorithms to 35 847 unstructured cervical pathology reports and assessed NLP performance in identifying the most severe diagnosis, compared to expert manual review. NLP performance was determined by calculating precision, recall, and F score. Results.-The NLP algorithms yielded a precision of 0.957, a recall of 0.925, and an F score of 0.94. Additionally, we estimated that the time to evaluate each monthly biopsy file was significantly reduced, from 30 hours to 0.5 hours.Conclusions.-A set of validated NLP algorithms applied to pathology reports can rapidly and efficiently assign a discrete, actionable diagnosis using CIN classification to assist with clinical management of cervical pathology and disease. Moreover, discrete diagnostic data encoded as CIN terminology can enhance the efficiency of clinical research.
引用
收藏
页码:222 / 226
页数:5
相关论文
共 18 条
[1]  
Chaudhry R, 2017, R21HS022911 AG HEALT
[2]  
College of American Pathologists, RES PUBL CANC PROT
[3]   The Lower Anogenital Squamous Terminology Standardization Project for HPV-associated Lesions: Background and Consensus Recommendations From the College of American Pathologists and the American Society for Colposcopy and Cervical Pathology [J].
Darragh, Teresa M. ;
Colgan, Terence J. ;
Cox, J. Thomas ;
Heller, Debra S. ;
Henry, Michael R. ;
Luff, Ronald D. ;
McCalmont, Timothy ;
Nayar, Ritu ;
Palefsky, Joel M. ;
Stoler, Mark H. ;
Wilkinson, Edward J. ;
Zaino, Richard J. ;
Wilbur, David C. .
INTERNATIONAL JOURNAL OF GYNECOLOGICAL PATHOLOGY, 2013, 32 (01) :76-115
[4]  
Elkin Peter L, 2008, AMIA Annu Symp Proc, P172
[5]   Extracting information from the text of electronic medical records to improve case detection: a systematic review [J].
Ford, Elizabeth ;
Carroll, John A. ;
Smith, Helen E. ;
Scott, Donia ;
Cassell, Jackie A. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (05) :1007-1015
[6]   Using Natural Language Processing to Extract Abnormal Results From Cancer Screening Reports [J].
Moore, Carlton R. ;
Farrag, Ashraf ;
Ashkin, Evan .
JOURNAL OF PATIENT SAFETY, 2017, 13 (03) :138-143
[7]   The Bethesda System for Reporting Cervical Cytology: A Historical Perspective [J].
Nayar, Ritu ;
Wilbur, David C. .
ACTA CYTOLOGICA, 2017, 61 (4-5) :359-372
[8]   The Lower Anogenital Squamous Terminology Project and Its Implications for Clinical Care [J].
Nuno, Tomas ;
Garcia, Francisco .
OBSTETRICS AND GYNECOLOGY CLINICS OF NORTH AMERICA, 2013, 40 (02) :225-+
[9]   Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review [J].
Sheikhalishahi, Seyedmostafa ;
Miotto, Riccardo ;
Dudley, Joel T. ;
Lavelli, Alberto ;
Rinaldi, Fabio ;
Osmani, Venet .
JMIR MEDICAL INFORMATICS, 2019, 7 (02) :15-32
[10]  
Si Yuqi, 2018, AMIA Annu Symp Proc, V2018, P1524