Unsupervised Machine Learning of Topics Documented by Nurses about Hospitalized Patients Prior to a Rapid-Response Event

被引:15
作者
Korach, Zfania Tom [1 ]
Cato, Kenrick D. [2 ]
Collins, Sarah A. [2 ,3 ]
Kang, Min Jeoung [1 ]
Knaplund, Christopher [2 ]
Dykes, Patricia C. [1 ]
Wang, Liqin [1 ]
Schnock, Kumiko O. [1 ]
Garcia, Jose P. [1 ]
Jia, Haomiao [2 ]
Chang, Frank [1 ]
Schwartz, Jessica M. [2 ]
Zhou, Li [1 ]
机构
[1] Brigham & Womens Hosp, Dept Med, Div Gen Internal Med & Primary Care, 75 Francis St, Boston, MA 02115 USA
[2] Columbia Univ, Sch Nursing, New York, NY USA
[3] Columbia Univ, Dept Biomed Informat, New York, NY USA
来源
APPLIED CLINICAL INFORMATICS | 2019年 / 10卷 / 05期
关键词
hospital rapid response team; nursing assessment; electronic health records; survival analysis; nursing notes; natural language processing; machine learning; medicine; MEDICAL EMERGENCY TEAM; ACTIVATION; MORTALITY; REASONS; ARREST; TRIAL;
D O I
10.1055/s-0039-3401814
中图分类号
R-058 [];
学科分类号
摘要
Background In the hospital setting, it is crucial to identify patients at risk for deterioration before it fully develops, so providers can respond rapidly to reverse the deterioration. Rapid response (RR) activation criteria include a subjective component ("worried about the patient") that is often documented in nurses' notes and is hard to capture and quantify, hindering active screening for deteriorating patients. Objectives We used unsupervised machine learning to automatically discover RR event risk/protective factors from unstructured nursing notes. Methods In this retrospective cohort study, we obtained nursing notes of hospitalized, nonintensive care unit patients, documented from 2015 through 2018 from Partners HealthCare databases. We applied topic modeling to those notes to reveal topics (clusters of associated words) documented by nurses. Two nursing experts named each topic with a representative Systematized Nomenclature of Medicine-Clinical Terms (SNOMED CT) concept. We used the concepts along with vital signs and demographics in a time-dependent covariates extended Cox model to identify risk/protective factors for RR event risk. Results From a total of 776,849 notes of 45,299 patients, we generated 95 stable topics, of which 80 were mapped to 72 distinct SNOMED CT concepts. Compared with a model containing only demographics and vital signs, the latent topics improved the model's predictive ability from a concordance index of 0.657 to 0.720. Thirty topics were found significantly associated with RR event risk at a 0.05 level, and 11 remained significant after Bonferroni correction of the significance level to 6.94E-04, including physical examination (hazard ratio [HR] = 1.07, 95% confidence interval [CI], 1.03-1.12), informing doctor (HR = 1.05, 95% CI, 1.03-1.08), and seizure precautions (HR = 1.08, 95% CI, 1.04-1.12). Conclusion Unsupervised machine learning methods can automatically reveal interpretable and informative signals from free-text and may support early identification of patients at risk for RR events.
引用
收藏
页码:952 / 963
页数:12
相关论文
共 18 条
  • [1] Andersen P K, 1992, Stat Methods Med Res, V1, P297, DOI 10.1177/096228029200100305
  • [2] [Anonymous], 2014, The Stanford CoreNLP Natural Language Processing Toolkit, DOI [10.3115/v1/p14-5010, DOI 10.3115/V1/P14-5010]
  • [3] [Anonymous], AMIA ANN S
  • [4] Prospective controlled trial of effect of medical emergency team on postoperative morbidity and mortality rates
    Bellomo, R
    Goldsmith, D
    Uchino, S
    Buckmaster, J
    Hart, G
    Opdam, H
    Silvester, W
    Doolan, L
    Gutteridge, G
    [J]. CRITICAL CARE MEDICINE, 2004, 32 (04) : 916 - 921
  • [5] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [6] RELATIONSHIP BETWEEN NURSING DOCUMENTATION AND PATIENTS' MORTALITY
    Collins, Sarah A.
    Cato, Kenrick
    Albers, David
    Scott, Karen
    Stetson, Peter D.
    Bakken, Suzanne
    Vawdrey, David K.
    [J]. AMERICAN JOURNAL OF CRITICAL CARE, 2013, 22 (04) : 306 - 313
  • [7] Importance of events per independent variable in proportional hazards analysis .1. Background, goals, and general strategy
    Concato, J
    Peduzzi, P
    Holford, TR
    Feinstein, AR
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (12) : 1495 - 1501
  • [8] Hillman K, 2005, LANCET, V365, P2091
  • [9] Incidence, location and reasons for avoidable in-hospital cardiac arrest in a district general hospital
    Hodgetts, TJ
    Kenward, G
    Vlackonikolis, L
    Payne, S
    Castle, N
    Crouch, R
    Ineson, N
    Shaikh, L
    [J]. RESUSCITATION, 2002, 54 (02) : 115 - 123
  • [10] Activation of a Medical Emergency Team Using an Electronic Medical Recording-Based Screening System
    Huh, Jin Won
    Lim, Chae-Man
    Koh, Younsuck
    Lee, Jury
    Jung, Youn-Kyung
    Seo, Hyun-Suk
    Hong, Sang-Bum
    [J]. CRITICAL CARE MEDICINE, 2014, 42 (04) : 801 - 808