Text mining approach to predict hospital admissions using early medical records from the emergency department

被引:80
作者
Lucini, Filipe R. [1 ]
Fogliatto, Flavio S. [1 ]
da Silveira, Giovani J. C. [2 ]
Neyeloff, Jeruza L. [3 ]
Anzanello, Michel J. [1 ]
Kuchenbecker, Ricardo de S. [3 ]
Schaan, Beatriz D. [3 ]
机构
[1] Univ Fed Rio Grande do Sul, Ind Engn Dept, Ave Osvaldo Aranha,99,5 Andar, BR-90035190 Porto Alegre, RS, Brazil
[2] Univ Calgary, Haskayne Sch Business, 2500 Univ Dr NW, Calgary, AB T2N 1N4, Canada
[3] Univ Fed Rio Grande do Sul, Hosp Clin Porto Alegre, Rua Ramiro Barcelos,2350, BR-90035903 Porto Alegre, RS, Brazil
关键词
Text mining; Emergency departments; Clinical decision support; CLASSIFICATION; ADABOOST; IDENTIFICATION; INFORMATION; TRANSFORM; SYSTEM;
D O I
10.1016/j.ijmedinf.2017.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Emergency department (ED) overcrowding is a serious issue for hospitals. Early information on short-term inward bed demand from patients receiving care at the ED may reduce the overcrowding problem, and optimize the use of hospital resources. In this study, we use text mining methods to process data from early ED patient records using the SOAP framework, and predict future hospitalizations and discharges. Design: We try different approaches for pre-processing of text records and to predict hospitalization. Sets of-words are obtained via binary representation, term frequency, and term frequency-inverse document frequency. Unigrams, bigrams and trigrams are tested for feature formation. Feature selection is based on chi(2) and F-score metrics. In the prediction module, eight text mining methods are tested: Decision Tree, Random Forest, Extremely Randomized Tree, AdaBoost, Logistic Regression, Multinomial Naive Bayes, Support Vector Machine (Kernel linear) and Nu-Support Vector Machine (Kernel linear). Measurements: Prediction performance is evaluated by F1-scores. Precision and Recall values are also informed for all text mining methods tested. Results: Nu-Support Vector Machine was the text mining method with the best overall performance. Its average F1-score in predicting hospitalization was 77.70%, with a standard deviation (SD) of 0.66%. Conclusions: The method could be used to manage daily routines in EDs such as capacity planning and resource allocation. Text mining could provide valuable information and facilitate decision-making by inward bed management teams. (C) 2017 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 62 条
[1]   Instrument for objective assessment of appropriateness of surgical bed occupancy: validation study [J].
Alijani, A ;
Hanna, GB ;
Ziyaie, D ;
Burns, SL ;
Campbell, KL ;
McMurdo, MET ;
Cuschieri, A .
BRITISH MEDICAL JOURNAL, 2003, 326 (7401) :1243-1244
[2]   A System for Classifying Disease Comorbidity Status from Medical Discharge Summaries Using Automated Hotspot and Negated Concept Detection [J].
Ambert, Kyle H. ;
Cohen, Aaron M. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2009, 16 (04) :590-595
[3]   Toward a complete dataset of drug-drug interaction information from publicly available sources [J].
Ayvaz, Serkan ;
Horn, John ;
Hassanzadeh, Oktie ;
Zhu, Qian ;
Stan, Johann ;
Tatonetti, Nicholas P. ;
Vilar, Santiago ;
Brochhausen, Mathias ;
Samwald, Matthias ;
Rastegar-Mojarad, Majid ;
Dumontier, Michel ;
Boyce, Richard D. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 55 :206-217
[4]   HMV: A medical decision support framework using multi-layer classifiers for disease prediction [J].
Bashir, Saba ;
Qamar, Usman ;
Khan, Farhan Hassan ;
Naseem, Lubna .
JOURNAL OF COMPUTATIONAL SCIENCE, 2016, 13 :10-25
[5]  
Bird, 2009, NATURAL LANGUAGE PRO
[6]   Integrated care facilitation for older patients with complex health care needs reduces hospital demand [J].
Bird, Stephen R. ;
Kurowski, William ;
Dickman, Gillian K. ;
Kronborg, Ian .
AUSTRALIAN HEALTH REVIEW, 2007, 31 (03) :451-461
[7]  
Borcherding S., 2007, THE OTAS GUIDE TO WR
[8]  
Brasil Tribunal de Contas da Unido, 2013, RELATORIO SISTEMICO
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Learning to write case notes using the SOAP format [J].
Cameron, S ;
Turtle-song, I .
JOURNAL OF COUNSELING AND DEVELOPMENT, 2002, 80 (03) :286-292