Bias-variance decomposition of absolute errors for diagnosing regression models of continuous data

被引:7
作者
Gao, Jing [1 ,2 ]
机构
[1] Univ Delaware, Data Sci Inst, Newark, DE 19716 USA
[2] Univ Delaware, Dept Geog & Spatial Sci, Newark, DE 19716 USA
来源
PATTERNS | 2021年 / 2卷 / 08期
关键词
absolute error; bias-variance decomposition; continuous data; continuous response; data-driven modeling; DSML 3: Development/pre-production: Data science output has been rolled out/validated across multiple domains/problems; empirical models; inductive learning; L1; loss; regressions;
D O I
10.1016/j.patter.2021.100309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bias-variance decomposition (BVD) is a powerful tool for understanding and improving data-driven models. It reveals sources of estimation errors. Existing literature has defined BVD for squared error but not absolute error, while absolute error is the more natural error metric and has shown advantages over squared error in many scientific fields. Here, I analytically derive the absolute-error BVD, empirically investigate its behaviors, and compare that with other error metrics. Different error metrics offer distinctly different perspectives. I find the commonly believed bias/variance trade-off under squared error is often absent under absolute error, and ensembles-a never hurt technique under squared error-could harm performance under absolute error. Compared with squared error, absolute-error BVD better promotes model traits reducing estimation residuals and better illustrates relative importance of different error sources. As data scientists pay increasing attention to uncertainty issues, the technique introduced here can be a useful addition to a data-driven modeler's toolset.
引用
收藏
页数:9
相关论文
共 9 条
[1]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1007/BF00058655
[2]  
BREIMAN L, 1996, Bias, variance, and arcing classifiers
[3]  
Domingos P., 2000, A Unifeid Bias-Variance Decomposition and its Applications, P231
[4]   Per-pixel bias-variance decomposition of continuous errors in data-driven geospatial modeling: A case study in environmental remote sensing [J].
Gao, Jing ;
Burt, James E. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 134 :110-121
[5]   NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].
GEMAN, S ;
BIENENSTOCK, E ;
DOURSAT, R .
NEURAL COMPUTATION, 1992, 4 (01) :1-58
[6]   Variance and bias for general loss functions [J].
James, GM .
MACHINE LEARNING, 2003, 51 (02) :115-135
[7]  
Schapire RE, 1998, ANN STAT, V26, P1651
[8]   Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance [J].
Willmott, CJ ;
Matsuura, K .
CLIMATE RESEARCH, 2005, 30 (01) :79-82
[9]   Ambiguities inherent in sums-of-squares-based error statistics [J].
Willmott, Cort J. ;
Matsuura, Kenji ;
Robeson, Scott M. .
ATMOSPHERIC ENVIRONMENT, 2009, 43 (03) :749-752