A review of predictive uncertainty estimation with machine learning

被引:19
作者
Tyralis, Hristos [1 ,2 ]
Papacharalampous, Georgia [1 ]
机构
[1] Natl Tech Univ Athens, Sch Rural Surveying & Geoinformat Engn, Dept Topog, Iroon Polytech 5, Zografos 15780, Greece
[2] Hellen Air Force, Construct Agcy, Mesogion Ave 227-231, Cholargos 15561, Greece
关键词
Boosting; Deep learning; Distributional regression; Ensemble learning; Machine learning; Probabilistic forecasting; Quantile regression; Random forests; PROPER SCORING RULES; REGRESSION CONFORMAL PREDICTION; RANKED PROBABILITY SCORE; QUANTILE REGRESSION; NEURAL-NETWORKS; DISTRIBUTIONAL REGRESSION; CONDITIONAL DENSITY; BAYESIAN-INFERENCE; ADDITIVE-MODELS; FORECAST VERIFICATION;
D O I
10.1007/s10462-023-10698-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predictions and forecasts of machine learning models should take the form of probability distributions, aiming to increase the quantity of information communicated to end users. Although applications of probabilistic prediction and forecasting with machine learning models in academia and industry are becoming more frequent, related concepts and methods have not been formalized and structured under a holistic view of the entire field. Here, we review the topic of predictive uncertainty estimation with machine learning algorithms, as well as the related metrics (consistent scoring functions and proper scoring rules) for assessing probabilistic predictions. The review covers a time period spanning from the introduction of early statistical (linear regression and time series models, based on Bayesian statistics or quantile regression) to recent machine learning algorithms (including generalized additive models for location, scale and shape, random forests, boosting and deep learning algorithms) that are more flexible by nature. The review of the progress in the field, expedites our understanding on how to develop new algorithms tailored to users' needs, since the latest advancements are based on some fundamental concepts applied to more complex algorithms. We conclude by classifying the material and discussing challenges that are becoming a hot topic of research.
引用
收藏
页数:65
相关论文
共 489 条
  • [1] A review of uncertainty quantification in deep learning: Techniques, applications and challenges
    Abdar, Moloud
    Pourpanah, Farhad
    Hussain, Sadiq
    Rezazadegan, Dana
    Liu, Li
    Ghavamzadeh, Mohammad
    Fieguth, Paul
    Cao, Xiaochun
    Khosravi, Abbas
    Acharya, U. Rajendra
    Makarenkov, Vladimir
    Nahavandi, Saeid
    [J]. INFORMATION FUSION, 2021, 76 : 243 - 297
  • [2] Local polynomial expectile regression
    Adam, C.
    Gijbels, I.
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2022, 74 (02) : 341 - 378
  • [3] Data-driven probabilistic machine learning in sustainable smart energy/smart energy systems: Key developments, challenges, and future research opportunities in the context of smart grid paradigm
    Ahmad, Tanveer
    Madonski, Rafal
    Zhang, Dongdong
    Huang, Chao
    Mujeeb, Asad
    [J]. RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2022, 160
  • [4] Recognizing a spatial extreme dependence structure: A deep learning approach
    Ahmed, Manaf
    Maume-Deschamps, Veronique
    Ribereau, Pierre
    [J]. ENVIRONMETRICS, 2022, 33 (04)
  • [5] Scoring and Testing Procedures Devoted to Probabilistic Seismic Hazard Assessment
    Albarello, Dario
    D'Amico, Vera
    [J]. SURVEYS IN GEOPHYSICS, 2015, 36 (02) : 269 - 293
  • [6] An analog ensemble for short-term probabilistic solar power forecast
    Alessandrini, S.
    Delle Monache, L.
    Sperati, S.
    Cervone, G.
    [J]. APPLIED ENERGY, 2015, 157 : 95 - 110
  • [7] Alexandrov A, 2020, J MACH LEARN RES, V21
  • [8] A review and taxonomy of wind and solar energy forecasting methods based on deep learning
    Alkhayat, Ghadah
    Mehmood, Rashid
    [J]. ENERGY AND AI, 2021, 4
  • [9] Antoran J., 2020, Advances in neural information processing systems, V33, P10620
  • [10] ON THE COMPARISON OF INTERVAL FORECASTS
    Askanazi, Ross
    Diebold, Francis X.
    Schorfheide, Frank
    Shin, Minchul
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2018, 39 (06) : 953 - 965