MediBoost: a Patient Stratification Tool for Interpretable Decision Making in the Era of Precision Medicine

被引:0
作者
Gilmer Valdes
José Marcio Luna
Eric Eaton
Charles B. Simone
Lyle H. Ungar
Timothy D. Solberg
机构
[1] University of California,Radiation Oncology Department
[2] San Francisco,Department of Radiation Oncology
[3] Perelman Center for Advance Medicine,Department of Computer and Information Science
[4] University of Pennsylvania,undefined
[5] University of Pennsylvania,undefined
来源
Scientific Reports | / 6卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning algorithms that are both interpretable and accurate are essential in applications such as medicine where errors can have a dire consequence. Unfortunately, there is currently a tradeoff between accuracy and interpretability among state-of-the-art methods. Decision trees are interpretable and are therefore used extensively throughout medicine for stratifying patients. Current decision tree algorithms, however, are consistently outperformed in accuracy by other, less-interpretable machine learning models, such as ensemble methods. We present MediBoost, a novel framework for constructing decision trees that retain interpretability while having accuracy similar to ensemble methods, and compare MediBoost’s performance to that of conventional decision trees and ensemble methods on 13 medical classification problems. MediBoost significantly outperformed current decision tree algorithms in 11 out of 13 problems, giving accuracy comparable to ensemble methods. The resulting trees are of the same type as decision trees used throughout clinical practice but have the advantage of improved accuracy. Our algorithm thus gives the best of both worlds: it grows a single, highly interpretable tree that has the high accuracy of ensemble methods.
引用
收藏
相关论文
共 47 条
  • [1] Baumann M(2016)Radiation oncology in the era of precision medicine Nat Rev Cancer 16 234-249
  • [2] Trusheim MR(2007)Stratified medicine: strategic and economic implications of combining drugs and clinical biomarkers Nat Rev Drug Discov 6 287-293
  • [3] Berndt ER(2016)Risk literacy in medical decision-making Science 352 413-414
  • [4] Douglas FL(2015)Patient-centric trials for therapeutic development in precision oncology Nature 526 361-370
  • [5] Operskalski JT(2001)Validation of clinical classification schemes for predicting stroke: results from the National Registry of Atrial Fibrillation Jama 285 2864-2870
  • [6] Barbey AK(2000)The TIMI risk score for unstable angina/non-ST elevation MI: A method for prognostication and therapeutic decision making Jama 284 835-842
  • [7] Biankin AV(2003)Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study Thorax 58 377-382
  • [8] Piantadosi S(1975)Precursors of sudden coronary death. Factors related to the incidence of sudden death Circulation 51 606-613
  • [9] Hollingsworth SJ(2014)Introduction of gluten, HLA status, and the risk of celiac disease in children N Engl J Med 371 1295-1303
  • [10] Gage BF(2014)A randomized trial of bevacizumab for newly diagnosed glioblastoma N Engl J Med 370 699-708