A review of predictive uncertainty estimation with machine learning

被引:35
作者
Tyralis, Hristos [1 ,2 ]
Papacharalampous, Georgia [1 ]
机构
[1] Natl Tech Univ Athens, Sch Rural Surveying & Geoinformat Engn, Dept Topog, Iroon Polytech 5, Zografos 15780, Greece
[2] Hellen Air Force, Construct Agcy, Mesogion Ave 227-231, Cholargos 15561, Greece
关键词
Boosting; Deep learning; Distributional regression; Ensemble learning; Machine learning; Probabilistic forecasting; Quantile regression; Random forests; PROPER SCORING RULES; REGRESSION CONFORMAL PREDICTION; RANKED PROBABILITY SCORE; QUANTILE REGRESSION; NEURAL-NETWORKS; DISTRIBUTIONAL REGRESSION; CONDITIONAL DENSITY; BAYESIAN-INFERENCE; ADDITIVE-MODELS; FORECAST VERIFICATION;
D O I
10.1007/s10462-023-10698-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictions and forecasts of machine learning models should take the form of probability distributions, aiming to increase the quantity of information communicated to end users. Although applications of probabilistic prediction and forecasting with machine learning models in academia and industry are becoming more frequent, related concepts and methods have not been formalized and structured under a holistic view of the entire field. Here, we review the topic of predictive uncertainty estimation with machine learning algorithms, as well as the related metrics (consistent scoring functions and proper scoring rules) for assessing probabilistic predictions. The review covers a time period spanning from the introduction of early statistical (linear regression and time series models, based on Bayesian statistics or quantile regression) to recent machine learning algorithms (including generalized additive models for location, scale and shape, random forests, boosting and deep learning algorithms) that are more flexible by nature. The review of the progress in the field, expedites our understanding on how to develop new algorithms tailored to users' needs, since the latest advancements are based on some fundamental concepts applied to more complex algorithms. We conclude by classifying the material and discussing challenges that are becoming a hot topic of research.
引用
收藏
页数:65
相关论文
共 486 条
[31]   Practical Heteroscedastic Gaussian Process Modeling for Large Simulation Experiments [J].
Binois, Mickael ;
Gramacy, Robert B. ;
Ludkovski, Mike .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2018, 27 (04) :808-821
[32]   An introduction to multivariate probabilistic forecast evaluation [J].
Bjerregard, Mathias Blicher ;
Moller, Jan Kloppenborg ;
Madsen, Henrik .
ENERGY AND AI, 2021, 4
[33]   Conformal prediction using random survival forests [J].
Bostrom, Henrik ;
Asker, Lars ;
Gurung, Ram ;
Karlsson, Isak ;
Lindgren, Tony ;
Papapetrou, Panagiotis .
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, :812-817
[34]  
BRECKLING J, 1988, BIOMETRIKA, V75, P761
[35]   Scoring interval forecasts: Equal-tailed, shortest, and modal interval [J].
Brehmer, Jonas R. ;
Gneiting, Tilmann .
BERNOULLI, 2021, 27 (03) :1993-2010
[36]   Properization: constructing proper scoring rules via Bayes acts [J].
Brehmer, Jonas R. ;
Gneiting, Tilmann .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2020, 72 (03) :659-673
[37]   Why scoring functions cannot assess tail properties [J].
Brehmer, Jonas R. ;
Strokorb, Kirstin .
ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02) :4015-4034
[38]   SEDE-GPS: socio-economic data enrichment based on GPS information [J].
Sperlea, Theodor ;
Fueser, Stefan ;
Boenigk, Jens ;
Heider, Dominik .
BMC BIOINFORMATICS, 2018, 19
[39]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[40]  
Brier G.W., 1950, MONTHLY WEATHER REV, V78, P1, DOI [DOI 10.1175/1520-0493(1950)078ANDLT