A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing

被引:4
|
作者
Panny, AlokSagar [1 ]
Hegde, Harshad [1 ]
Glurich, Ingrid [1 ]
Scannapieco, Frank A. [2 ]
Vedre, Jayanth G. [3 ]
VanWormer, Jeffrey J. [4 ]
Miecznikowski, Jeffrey [5 ]
Acharya, Amit [1 ,6 ]
机构
[1] Marshfield Clin Res Inst, Ctr Oral Syst Hlth, Marshfield, WI USA
[2] SUNY Buffalo, Sch Dent Med, Dept Oral Biol, Buffalo, NY USA
[3] Marshfield Clin Hlth Syst, Dept Crit Care Med, Marshfield, WI USA
[4] Marshfield Clin Res Inst, Ctr Clin Epidemiol & Populat Hlth, Marshfield, WI USA
[5] SUNY Buffalo, Sch Publ Hlth & Hlth Profess, Dept Biostat, Buffalo, NY USA
[6] Advocate Aurora Hlth, Advocate Aurora Res Inst, Downers Grove, IL 60515 USA
基金
美国国家卫生研究院;
关键词
pneumonia; natural language processing; knowledge bases;
D O I
10.1055/a-1817-7008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format. Objective The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format. Methods A pneumonia-specific natural language processing (NLP) pipeline was strategically developed applying Clinical Text Analysis and Knowledge Extraction System (cTAKES) to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive," "negative," or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes. Results A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest X-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as "Pneumonia-positive," 19% as (15401/81,707) as "Pneumonia-negative," and 48% (39,209/81,707) as "episode classification pending further manual review." NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%). Conclusion The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [1] Natural Language Processing to identify pneumonia from radiology reports
    Dublin, Sascha
    Baldwin, Eric
    Walker, Rod L.
    Christensen, Lee M.
    Haug, Peter J.
    Jackson, Michael L.
    Nelson, Jennifer C.
    Ferraro, Jeffrey
    Carrell, David
    Chapman, Wendy W.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (08) : 834 - 841
  • [2] Extracting information on pneumonia in infants using natural language processing of radiology reports
    Mendonça, EA
    Haas, J
    Shagina, L
    Larson, E
    Friedman, C
    JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (04) : 314 - 321
  • [3] Automatic Extraction of Major Osteoporotic Fractures from Radiology Reports using Natural Language Processing
    Wang, Yanshan
    Mehrabi, Saeed
    Sohn, Sunghwan
    Atkinson, Elizabeth
    Amin, Shreyasee
    Liu, Hongfang
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS WORKSHOPS (ICHI-W), 2018, : 64 - 65
  • [4] A systematic review of natural language processing applied to radiology reports
    Casey, Arlene
    Davidson, Emma
    Poon, Michael
    Dong, Hang
    Duma, Daniel
    Grivas, Andreas
    Grover, Claire
    Suarez-Paniagua, Victor
    Tobin, Richard
    Whiteley, William
    Wu, Honghan
    Alex, Beatrice
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [5] Natural language processing to identify ureteric stones in radiology reports
    Li, Andrew Yu
    Elliot, Nikki
    JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2019, 63 (03) : 307 - 310
  • [6] A systematic review of natural language processing applied to radiology reports
    Arlene Casey
    Emma Davidson
    Michael Poon
    Hang Dong
    Daniel Duma
    Andreas Grivas
    Claire Grover
    Víctor Suárez-Paniagua
    Richard Tobin
    William Whiteley
    Honghan Wu
    Beatrice Alex
    BMC Medical Informatics and Decision Making, 21
  • [7] Application of Natural Language Processing and Machine Learning to Radiology Reports
    Jeon, Seoungdeok
    Colburn, Zachary
    Sakai, Joshua
    Hung, Ling-Hong
    Yeung, Ka Yee
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [8] Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing
    Callen, Andrew L.
    Dupont, Sara M.
    Price, Adi
    Laguna, Ben
    McCoy, David
    Do, Bao
    Talbott, Jason
    Kohli, Marc
    Narvid, Jared
    JOURNAL OF DIGITAL IMAGING, 2020, 33 (05) : 1194 - 1201
  • [9] Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing
    Andrew L. Callen
    Sara M. Dupont
    Adi Price
    Ben Laguna
    David McCoy
    Bao Do
    Jason Talbott
    Marc Kohli
    Jared Narvid
    Journal of Digital Imaging, 2020, 33 : 1194 - 1201
  • [10] A Preliminary Study of Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports Using Natural Language Processing
    Yang, Shuang
    Yang, Xi
    Lyu, Tianchen
    He, Xing
    Braithwaite, Dejana
    Mehta, Hiren J.
    Guo, Yi
    Wu, Yonghui
    Bian, Jiang
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 618 - 619