A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing

被引:4
作者
Panny, AlokSagar [1 ]
Hegde, Harshad [1 ]
Glurich, Ingrid [1 ]
Scannapieco, Frank A. [2 ]
Vedre, Jayanth G. [3 ]
VanWormer, Jeffrey J. [4 ]
Miecznikowski, Jeffrey [5 ]
Acharya, Amit [1 ,6 ]
机构
[1] Marshfield Clin Res Inst, Ctr Oral Syst Hlth, Marshfield, WI USA
[2] SUNY Buffalo, Sch Dent Med, Dept Oral Biol, Buffalo, NY USA
[3] Marshfield Clin Hlth Syst, Dept Crit Care Med, Marshfield, WI USA
[4] Marshfield Clin Res Inst, Ctr Clin Epidemiol & Populat Hlth, Marshfield, WI USA
[5] SUNY Buffalo, Sch Publ Hlth & Hlth Profess, Dept Biostat, Buffalo, NY USA
[6] Advocate Aurora Hlth, Advocate Aurora Res Inst, Downers Grove, IL 60515 USA
基金
美国国家卫生研究院;
关键词
pneumonia; natural language processing; knowledge bases;
D O I
10.1055/a-1817-7008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format. Objective The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format. Methods A pneumonia-specific natural language processing (NLP) pipeline was strategically developed applying Clinical Text Analysis and Knowledge Extraction System (cTAKES) to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive," "negative," or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes. Results A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest X-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as "Pneumonia-positive," 19% as (15401/81,707) as "Pneumonia-negative," and 48% (39,209/81,707) as "episode classification pending further manual review." NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%). Conclusion The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [41] Automating Stroke Data Extraction From Free-Text Radiology Reports Using Natural Language Processing: Instrument Validation Study
    Yu, Amy Y. X.
    Liu, Zhongyu A.
    Pou-Prom, Chloe
    Lopes, Kaitlyn
    Kapral, Moira K.
    Aviv, Richard, I
    Mamdani, Muhammad
    JMIR MEDICAL INFORMATICS, 2021, 9 (05)
  • [42] Automated Extraction of BI-RADS Final Assessment Categories from Radiology Reports with Natural Language Processing
    Sippo, Dorothy A.
    Warden, Graham I.
    Andriole, Katherine P.
    Lacson, Ronilda
    Ikuta, Ichiro
    Birdwell, Robyn L.
    Khorasani, Ramin
    JOURNAL OF DIGITAL IMAGING, 2013, 26 (05) : 989 - 994
  • [43] Natural language processing in radiology: Clinical applications and future directions
    Bobba, Pratheek S.
    Sailer, Anne
    Pruneski, James A.
    Beck, Spencer
    Mozayan, Ali
    Mozayan, Sara
    Arango, Jennifer
    Cohan, Arman
    Chheang, Sophie
    CLINICAL IMAGING, 2023, 97 : 55 - 61
  • [44] Natural Language Processing in Radiology: A Systematic Review
    Pons, Ewoud
    Braun, Loes M. M.
    Hunink, M. G. Myriam
    Kors, Jan A.
    RADIOLOGY, 2016, 279 (02) : 329 - 343
  • [45] Using Natural Language Processing of Free-Text Radiology Reports to Identify Type 1 Modic Endplate Changes
    Hannu T. Huhdanpaa
    W. Katherine Tan
    Sean D. Rundell
    Pradeep Suri
    Falgun H. Chokshi
    Bryan A. Comstock
    Patrick J. Heagerty
    Kathryn T. James
    Andrew L. Avins
    Srdjan S. Nedeljkovic
    David R. Nerenz
    David F. Kallmes
    Patrick H. Luetmer
    Karen J. Sherman
    Nancy L. Organ
    Brent Griffith
    Curtis P. Langlotz
    David Carrell
    Saeed Hassanpour
    Jeffrey G. Jarvik
    Journal of Digital Imaging, 2018, 31 : 84 - 90
  • [46] Using Natural Language Processing of Free-Text Radiology Reports to Identify Type 1 Modic Endplate Changes
    Huhdanpaa, Hannu T.
    Tan, W. Katherine
    Rundell, Sean D.
    Suri, Pradeep
    Chokshi, Falgun H.
    Comstock, Bryan A.
    Heagerty, Patrick J.
    James, Kathryn T.
    Avins, Andrew L.
    Nedeljkovic, Srdjan S.
    Nerenz, David R.
    Kallmes, David F.
    Luetmer, Patrick H.
    Sherman, Karen J.
    Organ, Nancy L.
    Griffith, Brent
    Langlotz, Curtis P.
    Carrell, David
    Hassanpour, Saeed
    Jarvik, Jeffrey G.
    JOURNAL OF DIGITAL IMAGING, 2018, 31 (01) : 84 - 90
  • [47] Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports
    Po-Hao Chen
    Hanna Zafar
    Maya Galperin-Aizenberg
    Tessa Cook
    Journal of Digital Imaging, 2018, 31 : 178 - 184
  • [48] Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings
    Pham, Anne-Dominique
    Neveol, Aurelie
    Lavergne, Thomas
    Yasunaga, Daisuke
    Clement, Olivier
    Meyer, Guy
    Morello, Remy
    Burgun, Anita
    BMC BIOINFORMATICS, 2014, 15
  • [49] Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings
    Anne-Dominique Pham
    Aurélie Névéol
    Thomas Lavergne
    Daisuke Yasunaga
    Olivier Clément
    Guy Meyer
    Rémy Morello
    Anita Burgun
    BMC Bioinformatics, 15
  • [50] Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports
    Chen, Po-Hao
    Zafar, Hanna
    Galperin-Aizenberg, Maya
    Cook, Tessa
    JOURNAL OF DIGITAL IMAGING, 2018, 31 (02) : 178 - 184