How to measure uncertainty in uncertainty sampling for active learning

被引：0

作者：

Vu-Linh Nguyen

Mohammad Hossein Shaker

Eyke Hüllermeier

机构：

[1] Eindhoven University of Technology,Department of Mathematics and Computer Science

[2] Paderborn University,Heinz Nixdorf Institute and Department of Computer Science

来源：

Machine Learning | 2022年 / 111卷

关键词：

Active learning; Uncertainty sampling; Credal uncertainty; Epistemic uncertainty; Aleatoric uncertainty;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Various strategies for active learning have been proposed in the machine learning literature. In uncertainty sampling, which is among the most popular approaches, the active learner sequentially queries the label of those instances for which its current prediction is maximally uncertain. The predictions as well as the measures used to quantify the degree of uncertainty, such as entropy, are traditionally of a probabilistic nature. Yet, alternative approaches to capturing uncertainty in machine learning, alongside with corresponding uncertainty measures, have been proposed in recent years. In particular, some of these measures seek to distinguish different sources and to separate different types of uncertainty, such as the reducible (epistemic) and the irreducible (aleatoric) part of the total uncertainty in a prediction. The goal of this paper is to elaborate on the usefulness of such measures for uncertainty sampling, and to compare their performance in active learning. To this end, we instantiate uncertainty sampling with different measures, analyze the properties of the sampling strategies thus obtained, and compare them in an experimental study.

引用

页码：89 / 122

页数：33

共 45 条

[1]

Bernard JM(2005)An introduction to the imprecise Dirichlet model for multinomial data International Journal of Approximate Reasoning 39 123-150

[2]

Birnbaum A(1962)On the foundations of statistical inference Journal of the American Statistical Association 57 269-306

[3]

Bottou L(1992)Local learning algorithms Neural Computation 4 888-900

[4]

Vapnik V(2005)Active learning for Parzen window classifier Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics (AISTATS) 5 49-56

[5]

Chapelle O(1967)Nearest neighbor pattern classification IEEE Transactions on Information Theory 13 21-27

[6]

Cover T(1994)Probability intervals: A tool for uncertain reasoning International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 2 167-196

[7]

Hart P(2005)The elements of statistical learning: Data mining, inference and prediction The Mathematical Intelligencer 27 83-85

[8]

De Campos LM(1996)Aleatory and epistemic uncertainty in probability elicitation with an example from hazardous waste management Reliability Engineering and System Safety 54 217-223

[9]

Huete JF(2015)Optimised probabilistic active learning (OPAL) Machine Learning 100 449-476

[10]

Moral S(1996)A nearest neighbor bootstrap for resampling hydrologic time series Water Resources Research 32 679-693

← 1 2 3 4 5 →