Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

被引:760
作者
Huellermeier, Eyke [1 ,2 ]
Waegeman, Willem [3 ]
机构
[1] Paderborn Univ, Heinz Nixdorf Inst, Paderborn, Germany
[2] Paderborn Univ, Dept Comp Sci, Paderborn, Germany
[3] Univ Ghent, Dept Math Modelling Stat & Bioinformat, Ghent, Belgium
关键词
Uncertainty; Probability; Epistemic uncertainty; Version space learning; Bayesian inference; Calibration; Ensembles; Gaussian processes; Deep neural networks; Likelihood-based methods; Credal sets and classifiers; Conformal prediction; Set-valued prediction; Generative models; PROBABILITY; CLASSIFICATION; SETS; CLASSIFIERS; RULE;
D O I
10.1007/s10994-021-05946-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues such as safety requirements, new problems and challenges have recently been identified by machine learning scholars, and these problems may call for new methodological developments. In particular, this includes the importance of distinguishing between (at least) two different types of uncertainty, often referred to as aleatoric and epistemic. In this paper, we provide an introduction to the topic of uncertainty in machine learning as well as an overview of attempts so far at handling uncertainty in general and formalizing this distinction in particular.
引用
收藏
页码:457 / 506
页数:50
相关论文
共 121 条
  • [1] Disaggregated total uncertainty measure for credal sets
    Abellán, J
    Klir, GJ
    Moral, S
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2006, 35 (01) : 29 - 44
  • [2] A non-specificity measure for convex sets of probability distributions
    Abellan, Joaquin
    Moral, Serafin
    [J]. International Journal of Uncertainty, Fuzziness and Knowlege-Based Systems, 2000, 8 (03): : 357 - 367
  • [3] Agarwal S., 2015, ARXIVABS150504137 CO
  • [4] [Anonymous], 1970, Probability Theory
  • [5] [Anonymous], 2004, Science from Fisher Information: A Unification
  • [6] Antonucci A., 2012, P 6 EUROPEAN WORKSHO, P3
  • [7] Balasubramanian Vineeth, 2014, Conformal Prediction for Reliable Machine Learning
  • [8] The limits of distribution-free conditional predictive inference
    Barber, Rina Foygel
    Candes, Emmanuel J.
    Ramdas, Aaditya
    Tibshirani, Ryan J.
    [J]. INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2021, 10 (02) : 455 - 482
  • [9] Bazargami M, 2019, EUR C MACH LEARN KNO
  • [10] An introduction to the imprecise Dirichlet model for multinomial data
    Bernard, JM
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2005, 39 (2-3) : 123 - 150