Unfolding Physiological State: Mortality Modelling in Intensive Care Units

被引:116
作者
Ghassemi, Marzyeh [1 ]
Naumann, Tristan [1 ]
Doshi-Velez, Finale [2 ]
Brimmer, Nicole [1 ]
Joshi, Rohit [1 ]
Rumshisky, Anna [3 ]
Szolovits, Peter [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Harvard, Boston, MA 02115 USA
[3] Univ Massachusetts Lowell, Lowell, MA 01854 USA
来源
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14) | 2014年
关键词
Primary: Data mining for social good; Secondary: Healthcare and medicine; Topic; graphical and latent variable models; Text; Support vector machines; SCORE;
D O I
10.1145/2623330.2623742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: inhospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 17 条
  • [1] [Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
  • [2] Arnold Corey W, 2010, AMIA Annu Symp Proc, V2010, P26
  • [3] Blei D.M., 2006, P 23 INT C MACHINE L, P113, DOI [DOI 10.1145/1143844.1143859, 10.1145/1143844.114385]
  • [4] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [5] Ghassemi M., 2012, P ICML 2012 MACH LEA
  • [6] Finding scientific topics
    Griffiths, TL
    Steyvers, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 : 5228 - 5235
  • [7] Hug Caleb W, 2009, AMIA Annu Symp Proc, V2009, P260
  • [8] A New Severity of Illness Scale Using a Subset of Acute Physiology and Chronic Health Evaluation Data Elements Shows Comparable Predictive Accuracy
    Johnson, Alistair E. W.
    Kramer, Andrew A.
    Clifford, Gari D.
    [J]. CRITICAL CARE MEDICINE, 2013, 41 (07) : 1711 - 1718
  • [9] THE APACHE-III PROGNOSTIC SYSTEM - RISK PREDICTION OF HOSPITAL MORTALITY FOR CRITICALLY ILL HOSPITALIZED ADULTS
    KNAUS, WA
    WAGNER, DP
    DRAPER, EA
    ZIMMERMAN, JE
    BERGNER, M
    BASTOS, PG
    SIRIO, CA
    MURPHY, DJ
    LOTRING, T
    DAMIANO, A
    HARRELL, FE
    [J]. CHEST, 1991, 100 (06) : 1619 - 1636
  • [10] A NEW SIMPLIFIED ACUTE PHYSIOLOGY SCORE (SAPS-II) BASED ON A EUROPEAN NORTH-AMERICAN MULTICENTER STUDY
    LEGALL, JR
    LEMESHOW, S
    SAULNIER, F
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1993, 270 (24): : 2957 - 2963