Active Learning Method for Constraint-Based Clustering Algorithms

被引:5
作者
Cai, Lijun [1 ]
Yu, Tinghao [1 ]
He, Tingqin [1 ]
Chen, Lei [1 ]
Lin, Meiqi [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, PT II | 2016年 / 9659卷
关键词
Active learning; Clustering; Pairwise constraints;
D O I
10.1007/978-3-319-39958-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervision clustering aims to improve clustering performance with the help of user-provided side information. The pairwise constraints have become one of the most studied types of side information. According to the previous studies, such constraints increase clustering performance, but the choice of constraints is critical. If the constraints are selected improperly, they may even degrade the clustering performance. In order to solve this problem, researchers proposed some learning methods to actively select most informative pairwise constraints. In this paper, we presents a new active learning method for selecting informative data set, which significantly improves both the Explore phase and the Consolidate phase of the Min-Max algorithm. Experimental results on the data set of UCI Machine Learning Repository, using MPCK-means as the underlying constraint-based semi-supervised clustering algorithm, show that the proposed algorithm has better performance.
引用
收藏
页码:319 / 329
页数:11
相关论文
共 21 条
[1]  
Basu S, 2004, SIAM PROC S, P333
[2]  
Blake C., 1998, UCI REPOSITORY MACHI
[3]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[4]  
Davidson I, 2006, LECT NOTES ARTIF INT, V4213, P115
[5]   Probabilistic characterization of nearest neighbor classifier [J].
Dhurandhar, Amit ;
Dobra, Alin .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (04) :259-272
[6]  
Greene D, 2007, LECT NOTES ARTIF INT, V4701, P140
[7]  
Guo Y., 2007, P 20 INT C NEURAL IN, P593
[8]  
Hoi S.C.H., 2008, 2013 IEEE C COMP VIS, P1
[9]   Semi-supervised document clustering via active learning with pairwise constraints [J].
Huang, Ruizhang ;
Lam, Wai .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :517-522
[10]  
Kaufman L., 1990, J AM STAT ASSOC, V86, P830