A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

被引：7

作者：

Smith, Hayley ^{[1
]}

Sweeting, Michael ^{[1
,2
]}

Morris, Tim ^{[3
]}

Crowther, Michael ^{[4
]}

机构：

[1] Univ Leicester, Dept Hlth Sci, Leicester LE1 7RH, England

[2] Stat Innovat, Oncol Biometr, Oncol R&D, AstraZeneca, Cambridge, England

[3] UCL, MRC Clin Trials Unit, 90 High Holborn, London WC1V 6LJ, England

[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden

来源：

DIAGNOSTIC AND PROGNOSTIC RESEARCH | 2022年 / 6卷 / 01期

关键词：

Machine learning; Prognostic modelling; Clinical risk prediction; Survival analysis; Simulation studies; CENSORED SURVIVAL-DATA; FRACTIONAL POLYNOMIALS; CARDIOVASCULAR RISK; COX REGRESSION; MODELS; PROGNOSIS; DESIGN;

D O I：

10.1186/s41512-022-00124-y

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading.Methods We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them.Results A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method's performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated.Conclusion It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular.

引用

页数：15

共 59 条

[1] Joint estimation of time-dependent and non-linear effects of continuous covariates on survival [J].