Recurrent Neural Networks for Early Detection of Heart Failure From Longitudinal Electronic Health Record Data Implications for Temporal Modeling With Respect to Time Before Diagnosis, Data Density, Data Quantity, and Data Type

被引:40
作者
Chen, Robert [1 ,2 ]
Stewart, Walter F. [4 ]
Sun, Jimeng [2 ]
Ng, Kenney [3 ]
Yan, Xiaowei [1 ]
机构
[1] Sutter Hlth Res, Res, Walnut Creek, CA USA
[2] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[3] IBM Res, TJ Watson Res Ctr, Ctr Computat Hlth, Yorktown Hts, NY USA
[4] Step2Works, Orinda, CA USA
来源
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES | 2019年 / 12卷 / 10期
基金
美国国家科学基金会;
关键词
diagnosis; electronic health records; heart failure; machine learning; mortality; EVENTS;
D O I
10.1161/CIRCOUTCOMES.118.005114
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: We determined the impact of data volume and diversity and training conditions on recurrent neural network methods compared with traditional machine learning methods. Methods and Results: Using longitudinal electronic health record data, we assessed the relative performance of machine learning models trained to detect a future diagnosis of heart failure in primary care patients. Model performance was assessed in relation to data parameters defined by the combination of different data domains (data diversity), the number of patient records in the training data set (data quantity), the number of encounters per patient (data density), the prediction window length, and the observation window length (ie, the time period before the prediction window that is the source of features for prediction). Data on 4370 incident heart failure cases and 30 132 group-matched controls were used. Recurrent neural network model performance was superior under a variety of conditions that included (1) when data were less diverse (eg, a single data domain like medication or vital signs) given the same training size; (2) as data quantity increased; (3) as density increased; (4) as the observation window length increased; and (5) as the prediction window length decreased. When all data domains were used, the performance of recurrent neural network models increased in relation to the quantity of data used (ie, up to 100% of the data). When data are sparse (ie, fewer features or low dimension), model performance is lower, but a much smaller training set size is required to achieve optimal performance compared with conditions where data are more diverse and includes more features. Conclusions: Recurrent neural networks are effective for predicting a future diagnosis of heart failure given sufficient training set size. Model performance appears to continue to improve in direct relation to training set size.
引用
收藏
页数:15
相关论文
共 27 条
[1]   Risk Prediction Models for Mortality in Ambulatory Patients With Heart Failure A Systematic Review [J].
Alba, Ana C. ;
Agoritsas, Thomas ;
Jankowski, Milosz ;
Courvoisier, Delphine ;
Walter, Stephen D. ;
Guyatt, Gordon H. ;
Ross, Heather J. .
CIRCULATION-HEART FAILURE, 2013, 6 (05) :881-889
[2]  
[Anonymous], J PATIENT CENTERED R
[3]   A Study on Human Activity Recognition Using Accelerometer Data from Smartphones [J].
Bayat, Akram ;
Pomplun, Marc ;
Tran, Duc A. .
9TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC'14) / THE 11TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC'14) / AFFILIATED WORKSHOPS, 2014, 34 :450-457
[4]   Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis [J].
Beaulieu-Jones, Brett K. ;
Lavage, Daniel R. ;
Snyder, John W. ;
Moore, Jason H. ;
Pendergrass, Sarah A. ;
Bauer, Christopher R. .
JMIR MEDICAL INFORMATICS, 2018, 6 (01)
[5]  
Bishop C., 2006, Pattern Recognition and Machine Learning, P33
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Choi E, 2016, ADV NEUR IN, V29
[8]   Using recurrent neural network models for early detection of heart failure onset [J].
Choi, Edward ;
Schuetz, Andy ;
Stewart, Walter F. ;
Sun, Jimeng .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (02) :361-370
[9]  
Feng Z, 2015, 2015 37 ANN INT C IE
[10]   Contemporary Prevalence and Correlates of Incident Heart Failure with Preserved Ejection Fraction [J].
Gurwitz, Jerry H. ;
Magid, David J. ;
Smith, David H. ;
Goldberg, Robert J. ;
McManus, David D. ;
Allen, Larry A. ;
Saczynski, Jane S. ;
Thorp, Micah L. ;
Hsu, Grace ;
Sung, Sue Hee ;
Go, Alan S. .
AMERICAN JOURNAL OF MEDICINE, 2013, 126 (05) :393-400