A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing

被引:4
|
作者
Panny, AlokSagar [1 ]
Hegde, Harshad [1 ]
Glurich, Ingrid [1 ]
Scannapieco, Frank A. [2 ]
Vedre, Jayanth G. [3 ]
VanWormer, Jeffrey J. [4 ]
Miecznikowski, Jeffrey [5 ]
Acharya, Amit [1 ,6 ]
机构
[1] Marshfield Clin Res Inst, Ctr Oral Syst Hlth, Marshfield, WI USA
[2] SUNY Buffalo, Sch Dent Med, Dept Oral Biol, Buffalo, NY USA
[3] Marshfield Clin Hlth Syst, Dept Crit Care Med, Marshfield, WI USA
[4] Marshfield Clin Res Inst, Ctr Clin Epidemiol & Populat Hlth, Marshfield, WI USA
[5] SUNY Buffalo, Sch Publ Hlth & Hlth Profess, Dept Biostat, Buffalo, NY USA
[6] Advocate Aurora Hlth, Advocate Aurora Res Inst, Downers Grove, IL 60515 USA
基金
美国国家卫生研究院;
关键词
pneumonia; natural language processing; knowledge bases;
D O I
10.1055/a-1817-7008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format. Objective The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format. Methods A pneumonia-specific natural language processing (NLP) pipeline was strategically developed applying Clinical Text Analysis and Knowledge Extraction System (cTAKES) to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive," "negative," or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes. Results A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest X-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as "Pneumonia-positive," 19% as (15401/81,707) as "Pneumonia-negative," and 48% (39,209/81,707) as "episode classification pending further manual review." NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%). Conclusion The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [21] Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm
    Selen Bozkurt
    Emel Alkim
    Imon Banerjee
    Daniel L. Rubin
    Journal of Digital Imaging, 2019, 32 : 544 - 553
  • [22] Automated labelling of radiology reports using natural language processing: Comparison of traditional and newer methods
    Chng, Seo Yi
    Tern, Paul J. W.
    Kan, Matthew R. X.
    Cheng, Lionel T. E.
    HEALTH CARE SCIENCE, 2023, 2 (02): : 120 - 128
  • [23] Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports
    Zheng, Chengyi
    Huang, Brian Z.
    Agazaryan, Andranik A.
    Creekmur, Beth
    Osuj, Thearis A.
    Gould, Michael K.
    CHEST, 2021, 160 (05) : 1902 - 1914
  • [24] Natural Language Processing of Radiology Reports in Patients With Hepatocellular Carcinoma to Predict Radiology Resource Utilization
    Brown, A. D.
    Kachura, J. R.
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (06) : 840 - 844
  • [25] Natural language processing and foreign language learning: Methodological approach from the performance of a linguistic task
    Canas, Alejandro Ramirez
    CUADERNO ACTIVA, 2022, (14): : 43 - 63
  • [26] Quality Management of Pulmonary Nodule Radiology Reports Based on Natural Language Processing
    Fei, Xiaolu
    Chen, Pengyu
    Wei, Lan
    Huang, Yue
    Xin, Yi
    Li, Jia
    BIOENGINEERING-BASEL, 2022, 9 (06):
  • [27] Machine learning based natural language processing of radiology reports in orthopaedic trauma
    Olthof, A. W.
    Shouche, P.
    Fennema, E. M.
    IJpma, F. F. A.
    Koolstra, R. H. C.
    Stirler, V. M. A.
    van Ooijen, P. M. A.
    Cornelissen, L. J.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [28] Natural language processing to predict isocitrate dehydrogenase genotype in diffuse glioma using MR radiology reports
    Kim, Minjae
    Ong, Kai Tzu-iunn
    Choi, Seonah
    Yeo, Jinyoung
    Kim, Sooyon
    Han, Kyunghwa
    Park, Ji Eun
    Kim, Ho Sung
    Choi, Yoon Seong
    Ahn, Sung Soo
    Kim, Jinna
    Lee, Seung-Koo
    Sohn, Beomseok
    EUROPEAN RADIOLOGY, 2023, 33 (11) : 8017 - 8025
  • [29] Natural language processing to predict isocitrate dehydrogenase genotype in diffuse glioma using MR radiology reports
    Minjae Kim
    Kai Tzu-iunn Ong
    Seonah Choi
    Jinyoung Yeo
    Sooyon Kim
    Kyunghwa Han
    Ji Eun Park
    Ho Sung Kim
    Yoon Seong Choi
    Sung Soo Ahn
    Jinna Kim
    Seung-Koo Lee
    Beomseok Sohn
    European Radiology, 2023, 33 : 8017 - 8025
  • [30] The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
    Rixe, Nancy
    Frisch, Adam
    Wang, Zhendong
    Martin, Judith M.
    Suresh, Srinivasan
    Florin, Todd A.
    Ramgopal, Sriram
    FRONTIERS IN DIGITAL HEALTH, 2023, 5