Using Natural Language Processing to Improve Discrete Data Capture From Interpretive Cervical Biopsy Diagnoses at a Large Health Care Organization

被引:3
作者
Wi, Soora [1 ,3 ]
Goldhoff, Patricia E. [1 ]
Fuller, Laurie A. [1 ]
Grewal, Kiranjit [1 ]
Wentzensen, Nicolas [2 ]
Clarke, Megan A. [2 ]
Lorey, Thomas S. [1 ]
机构
[1] Kaiser Permanente, TPMG Reg Labs, Berkeley, CA USA
[2] NCI, Div Canc Epidemiol & Genet, Bethesda, MD USA
[3] Kaiser Permanente, TPMG Reg Lab, 1725 Eastshore Hwy, Berkeley, CA 94710 USA
关键词
TERMINOLOGY; PROJECT; LESIONS; TEXT;
D O I
10.5858/arpa.2021-0410-OA
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Context.-The terminology used by pathologists to describe and grade dysplasia and premalignant changes of the cervical epithelium has evolved over time. Unfor-tunately, coexistence of different classification systems combined with nonstandardized interpretive text has created multiple layers of interpretive ambiguity.Objective.-To use natural language processing (NLP) to automate and expedite translation of interpretive text to a single most severe, and thus actionable, cervical intraep-ithelial neoplasia (CIN) diagnosis.Design.-We developed and applied NLP algorithms to 35 847 unstructured cervical pathology reports and assessed NLP performance in identifying the most severe diagnosis, compared to expert manual review. NLP performance was determined by calculating precision, recall, and F score. Results.-The NLP algorithms yielded a precision of 0.957, a recall of 0.925, and an F score of 0.94. Additionally, we estimated that the time to evaluate each monthly biopsy file was significantly reduced, from 30 hours to 0.5 hours.Conclusions.-A set of validated NLP algorithms applied to pathology reports can rapidly and efficiently assign a discrete, actionable diagnosis using CIN classification to assist with clinical management of cervical pathology and disease. Moreover, discrete diagnostic data encoded as CIN terminology can enhance the efficiency of clinical research.
引用
收藏
页码:222 / 226
页数:5
相关论文
共 18 条
[11]   Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records [J].
Solomon, Matthew D. ;
Tabada, Grace ;
Allen, Amanda ;
Sung, Sue Hee ;
Go, Alan S. .
CARDIOVASCULAR DIGITAL HEALTH JOURNAL, 2021, 2 (03) :156-163
[12]   The Interpretive Variability of Cervical Biopsies and Its Relationship to HPV Status [J].
Stoler, Mark H. ;
Ronnett, Brigitte M. ;
Joste, Nancy E. ;
Hunt, William C. ;
Cuzick, Jack ;
Wheeler, Cosette M. .
AMERICAN JOURNAL OF SURGICAL PATHOLOGY, 2015, 39 (06) :729-736
[13]  
United States Preventive Services Task Force (USPSTF), Cervical Cancer: Screening
[14]   Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances [J].
Velupillai, Sumithra ;
Suominen, Hanna ;
Liakata, Maria ;
Roberts, Angus ;
Shah, Anoop D. ;
Morley, Katherine ;
Osborn, David ;
Hayes, Joseph ;
Stewart, Robert ;
Downs, Johnny ;
Chapman, Wendy ;
Dutta, Rina .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 88 :11-19
[15]   Clinical decision support with automated text processing for cervical cancer screening [J].
Wagholikar, Kavishwar B. ;
MacLaughlin, Kathy L. ;
Henry, Michael R. ;
Greenes, Robert A. ;
Hankey, Ronald A. ;
Liu, Hongfang ;
Chaudhry, Rajeev .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (05) :833-839
[16]   Natural language processing for populating lung cancer clinical research data [J].
Wang, Liwei ;
Luo, Lei ;
Wang, Yanshan ;
Wampfler, Jason ;
Yang, Ping ;
Liu, Hongfang .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
[17]  
Waxman AG, 2012, OBSTET GYNECOL, V120, P1465, DOI [http://10.1097/AOG.0b013e31827001d5, 10.1097/AOG.0b013e31827001d5]
[18]   Using natural language processing and machine learning to identify breast cancer local recurrence [J].
Zeng, Zexian ;
Espino, Sasa ;
Roy, Ankita ;
Li, Xiaoyu ;
Khan, Seema A. ;
Clare, Susan E. ;
Jiang, Xia ;
Neapolitan, Richard ;
Luo, Yuan .
BMC BIOINFORMATICS, 2018, 19