A batch-mode active learning SVM method based on semi-supervised clustering

被引:10
作者
Fu, Chun-Jiang [1 ]
Yang, Yu-Pu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
关键词
Active learning; semi-supervised clustering; k-medoids; cluster assumption; support vector machine; CLASSIFICATION; TUTORIAL;
D O I
10.3233/IDA-150720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A batch-mode active learning technique taking advantage of the cluster assumption was proposed. It focused on binary classification tasks adopting SVM (support vector machine). In each active learning iteration, unlabeled instances in the SVM margin were first grouped into two clusters. Then from each cluster, points most similar to the other cluster were selected for labeling. Such points lying near the boundary between clusters were expected to become support vectors in the final classification model with high probability. The clustering process was performed in the same kernel space as SVM. With semi-supervised K-medoids, labeled instances were also used to improve the clustering performance. Experiments showed that the proposed method was efficient and robust (to poor initial samples).
引用
收藏
页码:345 / 358
页数:14
相关论文
共 35 条
[31]   Support vector machine active learning with applications to text classification [J].
Tong, S ;
Koller, D .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) :45-66
[32]  
Tong Simon, 2001, Support vector machine active learning for image retrieval, P107, DOI 10.1145/500141.500159
[33]   A tutorial on spectral clustering [J].
von Luxburg, Ulrike .
STATISTICS AND COMPUTING, 2007, 17 (04) :395-416
[34]  
Xu ZB, 2007, LECT NOTES COMPUT SC, V4425, P246
[35]  
Zhu Xiaojin, 2006, Semi-supervised learning literature survey, V2, P4