Predicting Complications in Critical Care Using Heterogeneous Clinical Data

被引:28
作者
Huddar, Vijay [1 ]
Desiraju, Bapu Koundinya [2 ]
Rajan, Vaibhav [1 ]
Bhattacharya, Sakyajit [1 ]
Roy, Shourya [1 ]
Reddy, Chandan K. [3 ]
机构
[1] Xerox Res Ctr India, Bangalore 560103, Karnataka, India
[2] Inst Genom & Integrat Biol, New Delhi 110025, India
[3] Virginia Tech, Dept Comp Sci, Arlington, VA 22203 USA
基金
美国国家科学基金会;
关键词
Clinical notes; topic models; heterogeneous data; multi-view learning; collective matrix factorization; postoperative respiratory failure; POSTOPERATIVE RESPIRATORY-FAILURE; AUTOMATED IDENTIFICATION; PULMONARY COMPLICATIONS; RISK; SURGERY; VALIDATION; INTUBATION; EVENTS; SCORE;
D O I
10.1109/ACCESS.2016.2618775
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patients in hospitals, particularly in critical care, are susceptible to many complications affecting morbidity and mortality. Digitized clinical data in electronic medical records can be effectively used to develop machine learning models to identify patients at risk of complications early and provide prioritized care to prevent complications. However, clinical data from heterogeneous sources within hospitals pose significant modeling challenges. In particular, unstructured clinical notes are a valuable source of information containing regular assessments of the patient's condition but contain inconsistent abbreviations and lack the structure of formal documents. Our contributions in this paper are twofold. First, we present a new preprocessing technique for extracting features from informal clinical notes that can be used in a classification model to identify patients at risk of developing complications. Second, we explore the use of collective matrix factorization, a multi-view learning technique, to model heterogeneous clinical data text-based features in combination with other measurements, such as clinical investigations, comor-bidites, and demographic data. We present a detailed case study on postoperative respiratory failure using more than 700 patient records from the MIMIC II database. Our experiments demonstrate the efficacy of our preprocessing technique in extracting discriminatory features from clinical notes as well as the benefits of multi-view learning to combine clinical measurements with text data for predicting complications.
引用
收藏
页码:7988 / 8001
页数:14
相关论文
共 66 条
[1]  
Archambeau C., 2006, P 23 INT C MACHINE L, P33, DOI DOI 10.1145/1143844.1143849
[2]   Multifactorial risk index for predicting postoperative respiratory failure in men after major noncardiac surgery [J].
Arozullah, AM ;
Daley, J ;
Henderson, WG ;
Khuri, SF .
ANNALS OF SURGERY, 2000, 232 (02) :242-253
[3]   Making Big Data Useful for Health Care: A Summary of the Inaugural MIT Critical Data Conference [J].
Badawi, Omar ;
Brennan, Thomas ;
Celi, Leo Anthony ;
Feng, Mengling ;
Ghassemi, Marzyeh ;
Ippolito, Andrea ;
Johnson, Alistair ;
Mark, Roger G. ;
Mayaud, Louis ;
Moody, George ;
Moses, Christopher ;
Naumann, Tristan ;
Nikore, Vipan ;
Pimentel, Marco ;
Pollard, Tom J. ;
Santos, Mauro ;
Stone, David J. ;
Zimolzak, Andrew .
JMIR MEDICAL INFORMATICS, 2014, 2 (02) :41-51
[4]   A Pattern Mining Approach for Classifying Multivariate Temporal Data [J].
Batal, Iyad ;
Valizadegan, Hamed ;
Cooper, Gregory F. ;
Hauskrecht, Milos .
2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, :358-365
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Preoperative and Intraoperative Predictors of Postoperative Acute Respiratory Distress Syndrome in a General Surgical Population [J].
Blum, James M. ;
Stentz, Michael J. ;
Dechert, Ronald ;
Jewell, Elizabeth ;
Engoren, Milo ;
Rosenberg, Andrew L. ;
Park, Pauline K. .
ANESTHESIOLOGY, 2013, 118 (01) :19-29
[7]   Development and Validation of a Score for Prediction of Postoperative Respiratory Complications [J].
Brueckmann, Britta ;
Villa-Uribe, Jose L. ;
Bateman, Brian T. ;
Rosse-Sundrup, Martina G. ;
Hess, Dean R. ;
Schlett, Christopher L. ;
Eikermann, Matthias .
ANESTHESIOLOGY, 2013, 118 (06) :1276-1285
[8]   Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach [J].
Caballero, Karla ;
Akella, Ram .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :69-78
[9]   Postoperative respiratory failure: pathogenesis, prediction, and prevention [J].
Canet, Jaume ;
Gallart, Lluis .
CURRENT OPINION IN CRITICAL CARE, 2014, 20 (01) :56-62
[10]   Prediction of Postoperative Pulmonary Complications in a Population-based Surgical Cohort [J].
Canet, Jaume ;
Gallart, Lluis ;
Gomar, Carmen ;
Paluzie, Guillem ;
Valles, Jordi ;
Castillo, Jordi ;
Sabate, Sergi ;
Mazo, Valentin ;
Briones, Zahara ;
Sanchis, Joaquin .
ANESTHESIOLOGY, 2010, 113 (06) :1338-1350