Evaluating the accuracy of lung-RADS score extraction from radiology reports: Manual entry versus natural language processing

被引:1
|
作者
Gandomi, Amir [1 ,2 ,7 ]
Hasan, Eusha [1 ,3 ]
Chusid, Jesse [1 ,3 ,4 ]
Paul, Subroto [1 ,3 ,5 ]
Inra, Matthew [1 ,3 ,5 ]
Makhnevich, Alex [1 ,2 ,3 ,4 ]
Raoof, Suhail [3 ,4 ,5 ]
Silvestri, Gerard [6 ]
Bade, Brett C. [1 ,2 ,3 ,5 ]
Cohen, Stuart L. [1 ,2 ,3 ,4 ]
机构
[1] Northwell, New Hyde Pk, NY USA
[2] Inst Hlth Syst Sci, Feinstein Inst Med Res, Manhasset, NY USA
[3] Donald & Barbara Zucker Sch Med Hofstra Northwell, Hempstead, NY USA
[4] North Shore Univ Hosp, Northwell, Manhasset, NY USA
[5] Lenox Hill Hosp, Northwell, New York, NY USA
[6] Med Univ South Carolina, Charleston, SC USA
[7] Hofstra Univ, Frank G Zarb Sch Business, Hempstead, NY USA
关键词
LC screening; Lung-RADS score; Follow-up; Manual entry; Natural language processing;
D O I
10.1016/j.ijmedinf.2024.105580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: Radiology scoring systems are critical to the success of lung cancer screening (LCS) programs, impacting patient care, adherence to follow-up, data management and reporting, and program evaluation. Lung CT Screening Reporting and Data System (Lung-RADS) is a structured radiology scoring system that provides recommendations for LCS follow-up that are utilized (a) in clinical care and (b) by LCS programs monitoring rates of adherence to follow-up. Thus, accurate reporting and reliable collection of Lung-RADS scores are fundamental components of LCS program evaluation and improvement. Unfortunately, due to variability in radiology reports, extraction of Lung-RADS scores is non-trivial, and best practices do not exist. The purpose of this project is to compare mechanisms to extract Lung-RADS scores from free-text radiology reports. Methods: We retrospectively analyzed reports of LCS low-dose computed tomography (LDCT) examinations performed at a multihospital integrated healthcare network in New York State between January 2016 and July 2023. We compared three methods of Lung-RADS score extraction: manual physician entry at time of report creation, manual LCS specialist entry after report creation, and an internally developed, rule-based natural language processing (NLP) algorithm. Accuracy, recall, precision, and completeness (i.e., the proportion of LCS exams to which a Lung-RADS score has been assigned) were compared between the three methods. Results: The dataset includes 24,060 LCS examinations on 14,243 unique patients. The mean patient age was 65 years, and most patients were male (54 %) and white (75 %). Completeness rate was 65 %, 68 %, and 99 % for radiologists' manual entry, LCS specialists' entry, and NLP algorithm, respectively. Accuracy, recall, and precision were high across all extraction methods (>94 %), though the NLP-based approach was consistently higher than both manual entries in all metrics. Discussion: An NLP-based method of LCS score determination is an efficient and more accurate means of extracting Lung-RADS scores than manual review and data entry. NLP-based methods should be considered best practice for extracting structured Lung-RADS scores from free-text radiology reports.
引用
收藏
页数:7
相关论文
共 30 条
  • [21] A validated natural language processing algorithm for brain imaging phenotypes from radiology reports in UK electronic health records
    Wheater, Emily
    Mair, Grant
    Sudlow, Cathie
    Alex, Beatrice
    Grover, Claire
    Whiteley, William
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [22] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Raza, Shaina
    Schwartz, Brian
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [23] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Shaina Raza
    Brian Schwartz
    BMC Medical Informatics and Decision Making, 23
  • [24] Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning
    Park, Hyung Jun
    Park, Namu
    Lee, Jang Ho
    Choi, Myeong Geun
    Ryu, Jin-Sook
    Song, Min
    Choi, Chang-Min
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [25] Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning
    Hyung Jun Park
    Namu Park
    Jang Ho Lee
    Myeong Geun Choi
    Jin-Sook Ryu
    Min Song
    Chang-Min Choi
    BMC Medical Informatics and Decision Making, 22
  • [26] Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing
    Liu Yi
    Zhu Li-Na
    Liu Qing
    Han Chao
    Zhang Xiao-Dong
    Wang Xiao-Ying
    中华医学杂志英文版, 2019, 132 (14) : 1673 - 1680
  • [27] Natural language processing in urology: Automated extraction of clinical information from histopathology reports of uro-oncology procedures
    Huang, Honghong
    Lim, Fiona Xin Yi
    Gu, Gary Tianyu
    Han, Matthew Jiangchou
    Fang, Andrew Hao Sen
    Chia, Elian Hui San
    Bei, Eileen Yen Tze
    Tham, Sarah Zhuling
    Ho, Henry Sun Sien
    Yuen, John Shyi Peng
    Sun, Aixin
    Lim, Jay Kheng Sit
    HELIYON, 2023, 9 (04)
  • [28] Automatic extraction of imaging observation and assessment categories from breast magnetic resonance imaging reports with natural language processing
    Liu, Yi
    Zhu, Li-Na
    Liu, Qing
    Han, Chao
    Zhang, Xiao-Dong
    Wang, Xiao-Ying
    CHINESE MEDICAL JOURNAL, 2019, 132 (14) : 1673 - 1680
  • [29] Cross-lingual Natural Language Processing on Limited Annotated Case/Radiology Reports in English and Japanese: Insights from the Real-MedNLP Workshop
    Yada, Shuntaro
    Nakamura, Yuta
    Wakamiya, Shoko
    Aramaki, Eiji
    METHODS OF INFORMATION IN MEDICINE, 2024,
  • [30] How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation
    Puts, Sander
    Nobel, Martijn
    Zegers, Catharina
    Bermejo, Inigo
    Robben, Simon
    Dekker, Andre
    JMIR FORMATIVE RESEARCH, 2023, 7