A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

被引:7
作者
Smith, Hayley [1 ]
Sweeting, Michael [1 ,2 ]
Morris, Tim [3 ]
Crowther, Michael [4 ]
机构
[1] Univ Leicester, Dept Hlth Sci, Leicester LE1 7RH, England
[2] Stat Innovat, Oncol Biometr, Oncol R&D, AstraZeneca, Cambridge, England
[3] UCL, MRC Clin Trials Unit, 90 High Holborn, London WC1V 6LJ, England
[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
关键词
Machine learning; Prognostic modelling; Clinical risk prediction; Survival analysis; Simulation studies; CENSORED SURVIVAL-DATA; FRACTIONAL POLYNOMIALS; CARDIOVASCULAR RISK; COX REGRESSION; MODELS; PROGNOSIS; DESIGN;
D O I
10.1186/s41512-022-00124-y
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading.Methods We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them.Results A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method's performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated.Conclusion It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular.
引用
收藏
页数:15
相关论文
共 59 条
[1]   Joint estimation of time-dependent and non-linear effects of continuous covariates on survival [J].
Abrahamowicz, Michal ;
MacKenzie, Todd A. .
STATISTICS IN MEDICINE, 2007, 26 (02) :392-408
[2]  
Alaa AM, 2019, Demystifying black-box models with symbolic metamodels
[3]   Reviewing the use and quality of machine learning in developing clinical prediction models for cardiovascular disease [J].
Allan, Simon ;
Olaiya, Raphael ;
Burhan, Rasan .
POSTGRADUATE MEDICAL JOURNAL, 2022, 98 (1161) :551-558
[4]   A time-dependent discrimination index for survival data [J].
Antolini, L ;
Boracchi, P ;
Biganzoli, E .
STATISTICS IN MEDICINE, 2005, 24 (24) :3927-3944
[5]   Predictive performance of machine and statistical learning methods: Impact of data-generating processes on external validity in the "large N, small p" setting [J].
Austin, Peter C. ;
Harrell, Frank E., Jr. ;
Steyerberg, Ewout W. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (06) :1465-1483
[6]   Dynamic Cox modelling based on fractional polynomials:: time-variations in gastric cancer prognosis [J].
Berger, U ;
Schäfer, J ;
Ulm, K .
STATISTICS IN MEDICINE, 2003, 22 (07) :1163-1180
[7]  
Binder Harald., 2013, R package version, 1
[8]   The c-index is not proper for the evaluation of -year predicted risks [J].
Blanche, Paul ;
Kattan, Michael W. ;
Gerds, Thomas A. .
BIOSTATISTICS, 2019, 20 (02) :347-357
[9]   Towards evidence-based computational statistics: lessons from clinical research on the role and design of real-data benchmark studies [J].
Boulesteix, Anne-Laure ;
Wilson, Rory ;
Hapfelmeier, Alexander .
BMC MEDICAL RESEARCH METHODOLOGY, 2017, 17
[10]   A Plea for Neutral Comparison Studies in Computational Sciences [J].
Boulesteix, Anne-Laure ;
Lauer, Sabine ;
Eugster, Manuel J. A. .
PLOS ONE, 2013, 8 (04)