The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation

被引:2320
作者
Chicco, Davide [1 ]
Warrens, Matthijs J. [2 ]
Jurman, Giuseppe [3 ]
机构
[1] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[2] Univ Groningen, Groningen Inst Educ Res, Groningen, Netherlands
[3] Fdn Bruno Kessler, Data Sci Hlth Unit, Trento, Italy
关键词
Regression; Regression evaluation; Regression evaluation rates; Coefficient of determination; Mean square error; Mean absolute error; Regression analysis; ABSOLUTE ERROR MAE; SAMPLE-SIZE; APPROXIMATION; ACCURACY; R-2;
D O I
10.7717/peerj-cs.623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the target can have only two values (usually encoded as 0 and 1), while in regression the target can have multiple values. Even if regression analysis has been employed in a huge number of machine learning studies, no consensus has been reached on a single, unified, standard metric to assess the results of the regression itself. Many studies employ the mean square error (MSE) and its rooted variant (RMSE), or the mean absolute error (MAE) and its percentage variant (MAPE). Although useful, these rates share a common drawback: since their values can range between zero and +infinity, a single value of them does not say much about the performance of the regression with respect to the distribution of the ground truth elements. In this study, we focus on two rates that actually generate a high score only if the majority of the elements of a ground truth group has been correctly predicted: the coefficient of determination (also known as R-squared or R-2) and the symmetric mean absolute percentage error (SMAPE). After showing their mathematical properties, we report a comparison between R-2 and SMAPE in several use cases and in two real medical scenarios. Our results demonstrate that the coefficient of determination (R-squared) is more informative and truthful than SMAPE, and does not have the interpretability limitations of MSE, RMSE, MAE and MAPE. We therefore suggest the usage of R-squared as standard metric to evaluate regression analyses in any scientific domain.
引用
收藏
页数:24
相关论文
共 106 条
[1]   MEAN SQUARE ERROR OF PREDICTION AS A CRITERION FOR SELECTING VARIABLES [J].
ALLEN, DM .
TECHNOMETRICS, 1971, 13 (03) :469-&
[2]  
Allen M.P., 2004, Understanding Regression Analysis
[3]  
Allen M.P., 1997, UNDERSTANDING REGRES, P91, DOI DOI 10.1007/978-0-585-25657-3_19
[4]   POINTS OF SIGNIFICANCE Simple linear regression [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2015, 12 (11) :999-1000
[5]  
[Anonymous], 2019, Estimation of Obesity Levels Based On Eating Habits and Physical Condition, DOI [10.24432/C5H31Z, DOI 10.24432/C5H31Z]
[6]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[7]  
[Anonymous], 2004, REGRESSION ANAL CONS
[8]   Visual acuity as a function of Zernike mode and level of root mean square error [J].
Applegate, RA ;
Ballentine, C ;
Gross, H ;
Sarver, EJ ;
Sarver, CA .
OPTOMETRY AND VISION SCIENCE, 2003, 80 (02) :97-105
[9]  
Armstrong J.S., 1985, Long-Range Forecasting: From Crystal Ball to Computer
[10]   ERROR MEASURES FOR GENERALIZING ABOUT FORECASTING METHODS - EMPIRICAL COMPARISONS [J].
ARMSTRONG, JS ;
COLLOPY, F .
INTERNATIONAL JOURNAL OF FORECASTING, 1992, 8 (01) :69-80