Active learning algorithm through the lens of rejection arguments

被引:0
作者
Christophe Denis
Mohamed Hebiri
Boris Ndjia Njike
Xavier Siebert
机构
[1] Université Gustave Eiffel,LAMA
[2] University of Mons,Mathematics and Operational Research
来源
Machine Learning | 2024年 / 113卷
关键词
Active learning; Rejection; Nonparametric learning; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining the uncertainty of regions of the input space. The main challenge lies in building a procedure that is computationally efficient and that offers appealing theoretical properties; most of the current methods satisfy only one or the other. In this paper, we use the classification with rejection in a novel way to estimate the uncertain regions. We provide an active learning algorithm and prove its theoretical benefits under classical assumptions. In addition to the theoretical results, numerical experiments are carried out on synthetic and non-synthetic datasets. These experiments provide empirical evidence that the use of rejection arguments in our active learning algorithm is beneficial and allows good performance in various statistical situations.
引用
收藏
页码:753 / 788
页数:35
相关论文
共 58 条
[1]  
Audibert J(2007)Fast learning rates for plug-in classifiers Annals of Statistics 35 608-633
[2]  
Tsybakov A(2009)Agnostic active learning Journal of Computer and System Sciences 75 78-89
[3]  
Balcan M-F(2008)Minimax bounds for active learning IEEE Transactions on Information Theory 54 2339-2353
[4]  
Beygelzimer A(1957)An optimum character recognition system using decision functions IRE Transactions on Electronic Computers 4 247-254
[5]  
Langford J(1994)Improving generalization with active learning Machine Learning 15 201-221
[6]  
Castro RM(2011)Two faces of active learning Theoretical Computer Science 412 1767-1781
[7]  
Nowak RD(2019)Consistency of plug-in confidence sets for classification in semi-supervised learning Journal of Nonparametric Statistics 32 42-72
[8]  
Chow C(1997)Selective sampling using the query by committee algorithm Machine Learning 28 133-168
[9]  
Cohn D(2016)Classification in general finite dimensional spaces with the k-nearest neighbor rule The Annals of Statistics 44 982-1009
[10]  
Atlas L(2011)Rates of convergence in active learning The Annals of Statistics 2 333-361