Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm

被引:5
作者
Gordon, Alexandra June [1 ]
Banerjee, Imon [2 ]
Block, Jason [3 ]
Winstead-Derlega, Christopher [4 ]
Wilson, Jennifer G. [1 ]
Mitarai, Tsuyoshi [1 ]
Jarrett, Michael [5 ]
Sanyal, Josh [6 ]
Rubin, Daniel L. [6 ]
Wintermark, Max [7 ]
Kohn, Michael A. [8 ]
机构
[1] Stanford Univ, Dept Emergency Med Crit Care, Stanford, CA USA
[2] Emory Univ, Dept Biomed Informat, Dept Radiol, Georgia Tech,Dept Biomed Engn, Atlanta, GA 30322 USA
[3] Stanford Univ, Dept Anesthesia, Div Crit Care, Stanford, CA USA
[4] Duke Univ, Div Emergency Med, Durham, NC USA
[5] QuesGen Syst Inc, Burlingame, CA USA
[6] Stanford Univ, Dept Biomed Data Sci Radiol & Med, Stanford, CA USA
[7] Stanford Univ, Dept Radiol, Neuroradiol Div, Stanford, CA USA
[8] Stanford Univ, Dept Emergency Med UCSF, Dept Epidemiol & Biostat, Stanford, CA USA
关键词
Emergency critical care; Natural language processing; Hospital mortality; MORTALITY; CARE;
D O I
10.1016/j.ajem.2021.11.001
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Background: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable "intracranial mass effect" (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. Methods: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. Results: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as "yes" for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as "yes" for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:- 0.9625 (95% CI 0.9443-0.9807). TF-IDF score categories were defined and had the following likelihood ratios: "positive" (TF-IDF score > 0.5) LR = 24.59; "inconclusive" (TFIDF 0.05-0.5) LR = 0.99; and "negative" (TF-IDF < 0.05) LR = 0.05.82% of reports were classified as either "positive" or "negative". In the test set, only 4 of 199 (2.0%) reports with a "negative" classification were false negatives and only 8 of 93 (8.6%) reports classified as "positive" were false positives. Conclusion: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:388 / 392
页数:5
相关论文
共 17 条
[1]   Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort [J].
Banerjee, Imon ;
Chen, Matthew C. ;
Lungren, Matthew P. ;
Rubin, Daniel L. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 :11-20
[2]  
Cer D, ARXIV2018180311175
[3]   Deep Learning to Classify Radiology Free-Text Reports [J].
Chen, Matthew C. ;
Ball, Robyn L. ;
Yang, Lingyao ;
Moradzadeh, Nathaniel ;
Chapman, Brian E. ;
Larson, David B. ;
Langlotz, Curtis P. ;
Amrhein, Timothy J. ;
Lungren, Matthew P. .
RADIOLOGY, 2018, 286 (03) :845-852
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[6]  
Fau Knaus Wa, APACHE ACUTE PHYSL C, V2021, P0090
[7]   Chart reviews in emergency medicine research: Where are the methods? [J].
Gilbert, EH ;
Lowenstein, SR ;
KoziolMcLain, J ;
Barta, DC ;
Steiner, J .
ANNALS OF EMERGENCY MEDICINE, 1996, 27 (03) :305-308
[8]   Outcome prediction in critical care: the Mortality Probability Models [J].
Higgins, Thomas L. ;
Teres, Daniel ;
Nathanson, Brian .
CURRENT OPINION IN CRITICAL CARE, 2008, 14 (05) :498-505
[9]  
Le Gall Jr Fau, SIMPLIFIED ACUTE PHY, V2021, P0090
[10]   MORTALITY PROBABILITY-MODELS (MPM-II) BASED ON AN INTERNATIONAL COHORT OF INTENSIVE-CARE UNIT PATIENTS [J].
LEMESHOW, S ;
TERES, D ;
KLAR, J ;
AVRUNIN, JS ;
GEHLBACH, SH ;
RAPOPORT, J .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1993, 270 (20) :2478-2486