Active learning with support vector machines

被引:83
作者
Kremer, Jan [1 ]
Pedersen, Kim Steenstrup [1 ]
Igel, Christian [1 ]
机构
[1] Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark
关键词
CLASSIFICATION; ONLINE;
D O I
10.1002/widm.1132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, active learning refers to algorithms that autonomously select the data points from which they will learn. There are many data mining applications in which large amounts of unlabeled data are readily available, but labels (e. g., human annotations or results coming from complex experiments) are costly to obtain. In such scenarios, an active learning algorithm aims at identifying data points that, if labeled and used for training, would most improve the learned model. Labels are then obtained only for the most promising data points. This speeds up learning and reduces labeling costs. Support vector machine (SVM) classifiers are particularly well-suited for active learning due to their convenient mathematical properties. They perform linear classification, typically in a kernel-induced feature space, which makes expressing the distance of a data point from the decision boundary straightforward. Furthermore, heuristics can efficiently help estimate how strongly learning from a data point influences the current model. This information can be used to actively select training samples. After a brief introduction to the active learning problem, we discuss different query strategies for selecting informative data points and review how these strategies give rise to different variants of active learning with SVMs. (C) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:313 / 326
页数:14
相关论文
共 50 条
[1]  
[Anonymous], 2012, AS C MACH LEARN
[2]  
[Anonymous], P SIGIR C RES DEV IN
[3]  
[Anonymous], 2004, KERNEL METHODS PATTE
[4]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[5]  
[Anonymous], 2009, P 26 ANN INT C MACH
[6]  
[Anonymous], P SPIE
[7]  
[Anonymous], 2001, Active learning: theory and applications
[8]  
[Anonymous], 2008, IEEE C COMP VIS PATT
[9]  
[Anonymous], J MACH LEARN RES
[10]  
[Anonymous], 2012, Synthesis Lectures on Artificial Intelligence and Machine Learning