Validating a Machine Learning Algorithm to Predict 30-Day Re-Admissions in Patients With Heart Failure: Protocol for a Prospective Cohort Study

被引:8
作者
Kakarmath, Sujay [1 ,2 ,3 ]
Golas, Sara [1 ]
Felsted, Jennifer [1 ,3 ]
Kvedar, Joseph [2 ,3 ,4 ]
Jethwani, Kamal [2 ,3 ,4 ]
Agboola, Stephen [1 ,2 ,3 ]
机构
[1] Partners HealthCare, Partners Connected Hlth, Data Sci & Analyt Unit, 25 New Chardon St,Suite 300, Boston, MA 02114 USA
[2] Massachusetts Gen Hosp, Dept Dermatol, Boston, MA 02114 USA
[3] Harvard Med Sch, Dept Dermatol, Boston, MA USA
[4] Partners HealthCare, Partners Connected Hlth, Boston, MA 02114 USA
来源
JMIR RESEARCH PROTOCOLS | 2018年 / 7卷 / 09期
关键词
electronic medical records; heart failure; machine learning; predictive algorithms; readmissions;
D O I
10.2196/resprot.9466
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Big data solutions, particularly machine learning predictive algorithms, have demonstrated the ability to unlock value from data in real time in many settings outside of health care. Rapid growth in electronic medical record adoption and the shift from a volume based to a value-based reimbursement structure in the US health care system has spurred investments in machine learning solutions. Machine learning methods can be used to build flexible, customized, and automated predictive models to optimize resource allocation and improve the efficiency and quality of health care. However, these models are prone to the problems of overfitting, confounding, and decay in predictive performance over time. It is, therefore, necessary to evaluate machine learning-based predictive models in an independent dataset before they can be adopted in the clinical practice. In this paper, we describe the protocol for independent, prospective validation of a machine learning-based model trained to predict the risk of 30-day re-admission in patients with heart failure. Objective: This study aims to prospectively validate a machine learning-based predictive model for inpatient admissions in patients with heart failure by comparing its predictions of risk for 30-day re-admissions against outcomes observed prospectively in an independent patient cohort. Methods: All adult patients with heart failure who are discharged alive from an inpatient admission will be prospectively monitored for 30-day re-admissions through reports generated by the electronic medical record system. Of these, patients who are part of the training dataset will be excluded to avoid information leakage to the algorithm. An expected sample size of 1228 index admissions will be required to observe a minimum of 100 30-day re-admission events. Deidentified structured and unstructured data will be fed to the algorithm, and its prediction will be recorded. The overall model performance will be assessed using the concordance statistic. Furthermore, multiple discrimination thresholds for screening high-risk patients will be evaluated according to the sensitivity, specificity, predictive values, and estimated cost savings to our health care system. Results: The project received funding in April 2017 and data collection began in June 2017. Enrollment was completed in July 2017. Data analysis is currently underway, and the first results are expected to be submitted for publication in October 2018. Conclusions: To the best of our knowledge, this is one of the first studies to prospectively evaluate a predictive machine learning algorithm in a real-world setting. Findings from this study will help to measure the robustness of predictions made by machine learning algorithms and set a realistic benchmark for expectations of gains that can be made through its application to health care.
引用
收藏
页数:7
相关论文
共 26 条
  • [1] Hospital Readmission as an Accountability Measure
    Axon, R. Neal
    Williams, Mark V.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 305 (05): : 504 - 505
  • [2] Data-Driven Decisions for Reducing Readmissions for Heart Failure: General Methodology and Case Study
    Bayati, Mohsen
    Braverman, Mark
    Gillam, Michael
    Mack, Karen M.
    Ruiz, George
    Smith, Mark S.
    Horvitz, Eric
    [J]. PLOS ONE, 2014, 9 (10):
  • [3] Contemporary Evidence About Hospital Strategies for Reducing 30-Day Readmissions A National Study
    Bradley, Elizabeth H.
    Curry, Leslie
    Horwitz, Leora I.
    Sipsma, Heather
    Thompson, Jennifer W.
    Elma, MaryAnne
    Walsh, Mary Norine
    Krumholz, Harlan M.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2012, 60 (07) : 607 - 614
  • [4] Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations
    Chen, Jonathan H.
    Asch, Steven M.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (26) : 2507 - 2509
  • [5] Decaying relevance of clinical data towards future decisions in data-driven inpatient clinical order sets
    Chen, Jonathan H.
    Alagappan, Muthuraman
    Goldstein, Mary K.
    Asch, Steven M.
    Altman, Russ B.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 102 : 71 - 79
  • [6] CMS, 2016, CALCULATED 2018 VALU
  • [7] Sample size considerations for the external validation of a multivariable prognostic model: a resampling study
    Collins, Gary S.
    Ogundimu, Emmanuel O.
    Altman, Douglas G.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (02) : 214 - 226
  • [8] Statistical evaluation of prognostic versus diagnostic models: Beyond the ROC curve
    Cook, Nancy R.
    [J]. CLINICAL CHEMISTRY, 2008, 54 (01) : 17 - 23
  • [9] Value and Limitations of Existing Scores for the Assessment of Cardiovascular Risk A Review for Clinicians
    Cooney, Marie Therese
    Dudina, Alexandra L.
    Graham, Ian M.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2009, 54 (14) : 1209 - 1227
  • [10] Barriers to apply cardiovascular prediction rules in primary care: a postal survey
    Eichler, Klaus
    Zoller, Marco
    Tschudi, Peter
    Steurer, Johann
    [J]. BMC FAMILY PRACTICE, 2007, 8 (1)