Validating a Machine Learning Algorithm to Predict 30-Day Re-Admissions in Patients With Heart Failure: Protocol for a Prospective Cohort Study

被引：8

作者：

Kakarmath, Sujay ^{[1
,2
,3
]}

Golas, Sara ^{[1
]}

Felsted, Jennifer ^{[1
,3
]}

Kvedar, Joseph ^{[2
,3
,4
]}

Jethwani, Kamal ^{[2
,3
,4
]}

Agboola, Stephen ^{[1
,2
,3
]}

机构：

[1] Partners HealthCare, Partners Connected Hlth, Data Sci & Analyt Unit, 25 New Chardon St,Suite 300, Boston, MA 02114 USA

[2] Massachusetts Gen Hosp, Dept Dermatol, Boston, MA 02114 USA

[3] Harvard Med Sch, Dept Dermatol, Boston, MA USA

[4] Partners HealthCare, Partners Connected Hlth, Boston, MA 02114 USA

来源：

JMIR RESEARCH PROTOCOLS | 2018年 / 7卷 / 09期

关键词：

electronic medical records; heart failure; machine learning; predictive algorithms; readmissions;

D O I：

10.2196/resprot.9466

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: Big data solutions, particularly machine learning predictive algorithms, have demonstrated the ability to unlock value from data in real time in many settings outside of health care. Rapid growth in electronic medical record adoption and the shift from a volume based to a value-based reimbursement structure in the US health care system has spurred investments in machine learning solutions. Machine learning methods can be used to build flexible, customized, and automated predictive models to optimize resource allocation and improve the efficiency and quality of health care. However, these models are prone to the problems of overfitting, confounding, and decay in predictive performance over time. It is, therefore, necessary to evaluate machine learning-based predictive models in an independent dataset before they can be adopted in the clinical practice. In this paper, we describe the protocol for independent, prospective validation of a machine learning-based model trained to predict the risk of 30-day re-admission in patients with heart failure. Objective: This study aims to prospectively validate a machine learning-based predictive model for inpatient admissions in patients with heart failure by comparing its predictions of risk for 30-day re-admissions against outcomes observed prospectively in an independent patient cohort. Methods: All adult patients with heart failure who are discharged alive from an inpatient admission will be prospectively monitored for 30-day re-admissions through reports generated by the electronic medical record system. Of these, patients who are part of the training dataset will be excluded to avoid information leakage to the algorithm. An expected sample size of 1228 index admissions will be required to observe a minimum of 100 30-day re-admission events. Deidentified structured and unstructured data will be fed to the algorithm, and its prediction will be recorded. The overall model performance will be assessed using the concordance statistic. Furthermore, multiple discrimination thresholds for screening high-risk patients will be evaluated according to the sensitivity, specificity, predictive values, and estimated cost savings to our health care system. Results: The project received funding in April 2017 and data collection began in June 2017. Enrollment was completed in July 2017. Data analysis is currently underway, and the first results are expected to be submitted for publication in October 2018. Conclusions: To the best of our knowledge, this is one of the first studies to prospectively evaluate a predictive machine learning algorithm in a real-world setting. Findings from this study will help to measure the robustness of predictions made by machine learning algorithms and set a realistic benchmark for expectations of gains that can be made through its application to health care.

引用

页数：7

共 26 条

[1] Hospital Readmission as an Accountability Measure
Axon, R. Neal
Williams, Mark V.
[J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 305 (05): : 504 - 505
[2] Data-Driven Decisions for Reducing Readmissions for Heart Failure: General Methodology and Case Study
Bayati, Mohsen
Braverman, Mark
Gillam, Michael
Mack, Karen M.
Ruiz, George
Smith, Mark S.
Horvitz, Eric
[J]. PLOS ONE, 2014, 9 (10):
[3] Contemporary Evidence About Hospital Strategies for Reducing 30-Day Readmissions A National Study
Bradley, Elizabeth H.
Curry, Leslie
Horwitz, Leora I.
Sipsma, Heather
Thompson, Jennifer W.
Elma, MaryAnne
Walsh, Mary Norine
Krumholz, Harlan M.
[J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2012, 60 (07) : 607 - 614
[4] Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations
Chen, Jonathan H.
Asch, Steven M.
[J]. NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (26) : 2507 - 2509
[5] Decaying relevance of clinical data towards future decisions in data-driven inpatient clinical order sets
Chen, Jonathan H.
Alagappan, Muthuraman
Goldstein, Mary K.
Asch, Steven M.
Altman, Russ B.
[J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 102 : 71 - 79
[6] CMS, 2016, CALCULATED 2018 VALU
[7] Sample size considerations for the external validation of a multivariable prognostic model: a resampling study
Collins, Gary S.
Ogundimu, Emmanuel O.
Altman, Douglas G.
[J]. STATISTICS IN MEDICINE, 2016, 35 (02) : 214 - 226
[8] Statistical evaluation of prognostic versus diagnostic models: Beyond the ROC curve
Cook, Nancy R.
[J]. CLINICAL CHEMISTRY, 2008, 54 (01) : 17 - 23
[9] Value and Limitations of Existing Scores for the Assessment of Cardiovascular Risk A Review for Clinicians
Cooney, Marie Therese
Dudina, Alexandra L.
Graham, Ian M.
[J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2009, 54 (14) : 1209 - 1227
[10] Barriers to apply cardiovascular prediction rules in primary care: a postal survey
Eichler, Klaus
Zoller, Marco
Tschudi, Peter
Steurer, Johann
[J]. BMC FAMILY PRACTICE, 2007, 8 (1)

← 1 2 3 →