Constrained Clustering With Imperfect Oracles

被引:15
作者
Zhu, Xiatian [1 ]
Loy, Chen Change [2 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Hong Kong, Peoples R China
关键词
Affinity propagation; constrained clustering; constraint propagation; feature selection; imperfect oracles; noisy constraints; similarity/distance measure; spectral clustering (SPClust); CLASSIFICATION; PROPAGATION;
D O I
10.1109/TNNLS.2014.2387425
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While clustering is usually an unsupervised operation, there are circumstances where we have access to prior belief that pairs of samples should (or should not) be assigned with the same cluster. Constrained clustering aims to exploit this prior belief as constraint (or weak supervision) to influence the cluster formation so as to obtain a data structure more closely resembling human perception. Two important issues remain open: 1) how to exploit sparse constraints effectively and 2) how to handle ill-conditioned/noisy constraints generated by imperfect oracles. In this paper, we present a novel pairwise similarity measure framework to address the above issues. Specifically, in contrast to existing constrained clustering approaches that blindly rely on all features for constraint propagation, our approach searches for neighborhoods driven by discriminative feature selection for more effective constraint diffusion. Crucially, we formulate a novel approach to handling the noisy constraint problem, which has been unrealistically ignored in the constrained clustering literature. Extensive comparative results show that our method is superior to the state-of-the-art constrained clustering approaches and can generally benefit existing pairwise similarity-based data clustering algorithms, such as spectral clustering and affinity propagation.
引用
收藏
页码:1345 / 1357
页数:13
相关论文
共 51 条
[1]  
[Anonymous], 2012, P ADV NIPS
[2]  
[Anonymous], 2003, Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence
[3]  
[Anonymous], 2001, ICML
[4]  
[Anonymous], 2012, ACM SIGKDD
[5]  
[Anonymous], 2010, Computer Vision and Pattern Recognition Work- shops (CVPRW), 2010 IEEE Computer Society Conference on
[6]  
[Anonymous], DATA MINING KNOWL DI
[7]  
[Anonymous], 2002, NIPS
[8]  
[Anonymous], 2008, P 17 ACM C INFORM KN
[9]  
Asuncion A., 2007, Uci machine learning repository
[10]  
Basu S, 2004, SIAM PROC S, P333