Active Image Clustering with Pairwise Constraints from Humans

被引:22
作者
Biswas, Arijit [1 ]
Jacobs, David [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
Clustering; Active learning; Human in the loop; Pairwise constraints; Image labeling;
D O I
10.1007/s11263-013-0680-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a method of clustering images that combines algorithmic and human input. An algorithm provides us with pairwise image similarities. We then actively obtain selected, more accurate pairwise similarities from humans. A novel method is developed to choose the most useful pairs to show a person, obtaining constraints that improve clustering. In a clustering assignment, elements in each data pair are either in the same cluster or in different clusters. We simulate inverting these pairwise relations and see how that affects the overall clustering. We choose a pair that maximizes the expected change in the clustering. The proposed algorithm has high time complexity, so we also propose a version of this algorithm that is much faster and exactly replicates our original algorithm. We further improve run-time by adding two heuristics, and show that these do not significantly impact the effectiveness of our method. We have run experiments in three different domains, namely leaf, face and scene images, and show that the proposed method improves clustering performance significantly.
引用
收藏
页码:133 / 147
页数:15
相关论文
共 41 条
[1]  
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1023/A:1022821128753
[2]  
Basu S., 2008, DATA MINING KNOWLEDG
[3]  
Basu S, 2002, MACHINE LEARNING, P19
[4]  
Basu S., 2004, 4 SIAM INT C DAT MIN
[5]  
Biswas A., 2011, INT C MACH LEARN 201
[6]  
Biswas A, 2012, PROC CVPR IEEE, P2152, DOI 10.1109/CVPR.2012.6247922
[7]  
Branson S, 2011, IEEE I CONF COMP VIS, P1832, DOI 10.1109/ICCV.2011.6126450
[8]  
Branson S, 2010, LECT NOTES COMPUT SC, V6314, P438, DOI 10.1007/978-3-642-15561-1_32
[9]  
Chellappa R., 1993, MARKOV RANDOM FIELDS
[10]  
Dagli CK, 2006, INT C PATT RECOG, P506