Predicting Complications in Critical Care Using Heterogeneous Clinical Data

被引:28
作者
Huddar, Vijay [1 ]
Desiraju, Bapu Koundinya [2 ]
Rajan, Vaibhav [1 ]
Bhattacharya, Sakyajit [1 ]
Roy, Shourya [1 ]
Reddy, Chandan K. [3 ]
机构
[1] Xerox Res Ctr India, Bangalore 560103, Karnataka, India
[2] Inst Genom & Integrat Biol, New Delhi 110025, India
[3] Virginia Tech, Dept Comp Sci, Arlington, VA 22203 USA
基金
美国国家科学基金会;
关键词
Clinical notes; topic models; heterogeneous data; multi-view learning; collective matrix factorization; postoperative respiratory failure; POSTOPERATIVE RESPIRATORY-FAILURE; AUTOMATED IDENTIFICATION; PULMONARY COMPLICATIONS; RISK; SURGERY; VALIDATION; INTUBATION; EVENTS; SCORE;
D O I
10.1109/ACCESS.2016.2618775
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patients in hospitals, particularly in critical care, are susceptible to many complications affecting morbidity and mortality. Digitized clinical data in electronic medical records can be effectively used to develop machine learning models to identify patients at risk of complications early and provide prioritized care to prevent complications. However, clinical data from heterogeneous sources within hospitals pose significant modeling challenges. In particular, unstructured clinical notes are a valuable source of information containing regular assessments of the patient's condition but contain inconsistent abbreviations and lack the structure of formal documents. Our contributions in this paper are twofold. First, we present a new preprocessing technique for extracting features from informal clinical notes that can be used in a classification model to identify patients at risk of developing complications. Second, we explore the use of collective matrix factorization, a multi-view learning technique, to model heterogeneous clinical data text-based features in combination with other measurements, such as clinical investigations, comor-bidites, and demographic data. We present a detailed case study on postoperative respiratory failure using more than 700 patient records from the MIMIC II database. Our experiments demonstrate the efficacy of our preprocessing technique in extracting discriminatory features from clinical notes as well as the benefits of multi-view learning to combine clinical measurements with text data for predicting complications.
引用
收藏
页码:7988 / 8001
页数:14
相关论文
共 66 条
[61]  
Virtanen S., 2012, Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence, UAI '12, P843
[62]   Unsupervised Learning of Disease Progression Models [J].
Wang, Xiang ;
Sontag, David ;
Wang, Fei .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :85-94
[63]  
Wiener J, 2012, NANOCON 2012, 4TH INTERNATIONAL CONFERENCE, P476
[64]  
WOLLSCHLAGER CM, 1988, DM-DIS MON, V34, P223
[65]  
Zhou Jiayu, 2012, KDD, V2012, P1095
[66]   From Micro to Macro: Data Driven Phenotyping by Densification of Longitudinal Electronic Medical Records [J].
Zhou, Jiayu ;
Wang, Fei ;
Hu, Jianying ;
Ye, Jieping .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :135-144